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PREFACE TO DOVER EDITION 


We are delighted that the book has been reprinted by Dover because it is still 
used at many places, for instance at Lund University and at courses for indus- 
trial audiences. We also regularly obtained requests from colleagues all over 
the world, who are using the book, and it is a real pleasure to tell them that 
the book is available in print again. Even if the popularity of adaptive con- 
trol has decreased somewhat in the international conferences it is interesting 
to see that the number of applications are increasing. Systems for adaptive 
cruise control and adaptive optics are typical examples. Adaptation is also an 
important mechanism in many biological systems. 

This edition is based on 1995 Addison-Wesley Edition, we have corrected 
some errors that are known to us and we have replaced a few figures. A 
Solutions Manual for the book is also available for instructors. The particular 
methods for automatic tuning, gain scheduling and continuous adaptation are 
still highly relevant as are the model reference adaptive controller and the 
self-tuner. The simulation too] used in the book is Simnon, but the simulation 
code and examples can easily be converted to other simulation packages. 

A specific feature of the book is the mixture between theoretical discussions 
and practical aspects of adaptive control. This makes the book useful for courses 
as well as for engineers implementing adaptive control systems. 

The specific products we described are still in production today, many of 
them have been upgraded. Practically all PID controllers developed today are 
using some form of automatic tuning. Use of gain scheduling has increased 
dramatically. Foxboro’s rule-based autotuner is still available but Foxboro has 
also introduced a model-based tuner. Relay auto-tuning has proved to be a very 
robust technique that is easy to use. Apart from the applications mentioned in 
the book it is also used in Delta V from Emerson. Adaptive techniques are also 
starting to be widely used in the automotive industry. A typical application is 
to replace calibration based on extensive tests by some form of adaptation. The 
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so called adaptive cruise control is, however, not an adaptive system but an 
ordinary feedback system. 

Minimum variance control has had a renaissance in the form of the Harris 
index, which is used as a basis for judging the quality of regulation in process 
control. It is a small step to apply the self-tuning regulator to make sure that 
good regulation is maintained continuously. The distributed control systems 
described have been updated but self-tuning is still available. The dialysis 
system from Gambro is today using gain-scheduling after a redesign of the flow 
system. Adaptive techniques also have a role to make many control systems 
easier to use. With increasing use of autonomy it can be expected that adaptive 
techniques will continue to be used and a basic knowledge of adaptive control 
is useful for engineers. 

We would like to thank John Grafton for a smooth collaboration with this 
edition. Very special thanks are due to Leif Andersson, who recovered an old 
TeX-version of the manuscript and made it work. 


Lund January 1, 2008 


Karl Johan Åström 
Björn Wittenmark 


Department of Automatic Control 
Lund Institute of Technology 
Box 118, 5-221 00 Lund, Sweden 


karl]_johan.astrom@control.1th.se 
bjorn.wittenmark@control.1th.se 


PREFACE TO 
ADDISON-WESLEY EDITION 


Adaptive control is a fascinating field for study and research. It is also of 
increasing practical importance, since adaptive techniques are being used more 
and more in industrial control systems. However, there are still many unsolved 
theoretical and practical issues. 


» Goal of the book Our goal is to give an introduction and an overview of the 
theoretical and practical aspects of adaptive control. 

Since knowledge about adaptive techniques is widely scattered in the lit- 
erature, it is difficult for a newcomer to get a good grasp of the field. In the 
book we introduce the basic ideas of adaptive control and compare different 
approaches. Practical aspects such as implementation and applications are 
presented in depth. These are very important for the understanding of the 
advantages and shortcomings of adaptive control. This book has evolved from 
many years of research and teaching in the field. 

After learning the materia] in the book a reader should have a good per- 
spective of adaptive techniques, an active knowledge of the key approaches, 
and a good sense of when adaptive techniques can be used and when other 
methods are more appropriate. 


« The new edition Adaptive control is a dynamic field of research and indus- 
trial applications. Much new knowledge has appeared which by itself motivates 
a new edition. 

We have used the first edition of the book to teach to a wide variety 
of audiences, in regular university courses, courses to engineers in industry, 
and short courses at conferences. These experiences combined with advances 
in research, have shaped the major revisions made in the new edition. We 
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have also benefited from feedback of students and colleagues in industry and 
universities, who have used the first edition of the book. 

New chapters have been added, and the material has been reorganized. 
Most of the chapters have been substantially revised. In the revision we have 
also given more emphasis to the connection between different design methods 
in adaptive control. There is a major change in the way we deal with the theory. 
In the first edition we relied on mathematics from a wide variety of sources. 
In the new edition we have to a large extent developed the results from first 
principles, To make this possible we have made stronger assumptions in a few 
cases, but the material is now much easier to teach. The reorganization of the 
material also makes it easier to use the book for different audiences. 

The first edition had two introductory chapters; they have now been com- 
pressed to one. In the first edition we started with model-reference adaptive 
systems following the historical tradition. In the second edition we start with 
parameter estimation and the self-tuning regulator. This has several advan- 
tages, one is that students can start to simulate and experiment with computer- 
based adaptive control at a much earlier state, another is that system identi- 
fication gives the natural background and the key concepts required to under- 
stand many aspects of adaptive control. 

The material on self-tuning control has been expanded substantially by 
introducing an extra chapter. This has made it possible to give a strict separa- 
tion between deterministic and stochastic self-tuners. This is advantageous in 
courses which are restricted to the deterministic case. 

The chapter on model-reference adaptive control has been expanded sub- 
stantially. The key results on stability theory are now derived from first prin- 
ciples. This makes it much easier to teach to students who lack a background 
in stability theory. A new section on adaptive control of nonlinear systems has 
also been added. 

The reorganization makes the transformation from algorithms to theory 
much smocther. The chapter on theory now naturally follows the development 
of nonlinear stability theory. The presentation of the theory has been modified 
substantially. A new section on stability of time-varying systems has been 
added. This makes it possible to get a much better understanding of adaptation 
of feedforward gains. It also is a good transition to the nonlinear case. Material 
on the nonlinear behavior of adaptive systems has also been added. This adds 
substantially to the understanding of the behavior of adaptive systems. 

The chapter on practical aspects and implementation has been rewritten 
completely to reflect the increased experience of practical use of adaptive con- 
trol. It has been very rewarding to observe the drastically increased industrial 
use of adaptive control. This has influenced the revision of the chapter on appli- 
cations. For example, adaptive control is now used extensively in automobiles. 

Many examples and simulations are given throughout the book to illustrate 
ideas and theory. Numerous problems are also given. There are theoretical 
problems as well as problems in which computers must be used for analysis 
and simulations. The examples and problems give the reader good insight into 
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properties, design procedures, and implementation of adaptive controllers. To 
maintain a reasonable size of the book we have also done careful pruning. 

To summarize, new research and new experiences have made it possible 
to present the field of adaptive control in what we hope is a better way. 


Outline of the Book 


» Background Material The first chapter gives a broad presentation of adap- 
tive control and background for its use. Real-time estimation, which is an es- 
sential part of adaptive control, is introduced in Chapter 2. Both discrete-time 
and continuous-time estimation are covered. 


« Self-tuning Regulators and Model-reference Adaptive Systems Chapters 3, 4, 
and 5 give two basic developments of adaptive control: self-tuning regulators 
(STR) and model-reference adaptive systems (MRAS). Today we do not make 
a distinction between these two approaches, since they are actually equivalent. 
We have tried to follow the historical development by mainly treating MRAS 
in continuous time and STR in discrete time. By doing so it is possible to cover 
many aspects of adaptive regulators. These chapters mainly cover the ideas and 
basic properties of the controllers. They also serve as a source of algorithms 
for adaptive control. 


e Theory Chapter 6 gives deeper coverage of the theory of adaptive con- 
trol. Questions such as stability, convergence, and robustness are discussed. 
Stochastic adaptive control is treated in Chapter 7. Depending on the back- 
ground of the students, some of the material in Chapters 6 and 7 can be 
omitted in an introductory course. 


« Broadening the View Automatic tuning of regulators, which is rapidly gain- 
ing industrial acceptance, is presented in Chapter 8. Gain scheduling is dis- 
cussed in Chapter 9. Even though adaptive controllers are very useful tools, 
they are not the only ways to deal with systems that have varying parameters. 
Since we believe that it is useful for an engineer to have several ways of solving 
a problem, alternatives to adaptive control are also included. Robust high-gain 
control and self-oscillating controllers are presented in Chapter 10. 


e Practical Aspects and Applications Chapter 11 gives suggestions for the 
implementation of adaptive controllers. The guidelines are based on practical 
experience in using adaptive controllers on real processes. Chapter 12 is a sum- 
mary of applications and description of some commercial adaptive controllers. 
The applications show that adaptive control can be used in many different types 
of processes, but also that all applications have special features that must be 
considered to obtain a good control system. 

«Perspectives Finally, Chapter 13 contains a brief review of some areas closely 
related to adaptive control that we have not been able to cover in the book. 
Connections to adaptive signal processing, expert systems, and neural networks 
are given. 
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Prerequisites 


The book is for a course at the graduate level for engineering majors. It is 
assumed that the reader already has good knowledge in automatic control and 
a basic knowledge in sampled data systems. At our university the course can be 
taken after an introductory course in feedback control and a course in digital 
control. The intent is also that the book should be useful for. an industrial 
audience. 


Course Configurations 


The book has been organized so that it can be used in different ways. An 
introductory course in adaptive control could cover Chapters 1, 2, 3, 4, 5, 8, 11, 
12, and 13. A more advanced course might include all chapters in the book. A 
course for an industrial audience could contain Chapters 1, parts of Chapters 2, 
3, 4, and 5, and Chapters 8, 9, 11, and 12. To get the full benefit of a course, it 
is important to supplement lectures with problem-solving sessions, simulation 
exercises, and laboratory experiments. 


Simulation Tools 


Computer simulation is an indispensible tool for understanding the behavior 
of adaptive systems. Most of the simulations in the book are done by using the 
interactive simulation package Simnon, which has been developed at our de- 
partment. Simnon is available for IBM-PC compatible computers and also for 
several workstations and mainframe computers. Further information can be ob- 
tained from SSPA Systems, Box 24001, S-400 22 Göteborg, Sweden, e-mail: sim- 
non@sspa.se. The macros used in the simulations are available for anonymous 
FTP from ftp.control.1th.se, directory /pub/books/adaptive_control. 
Adaptive systems can of course also be simulated using other tools. 


Supplements 


Complete solutions are available from your sales representative. Course lec- 
tures, lab exercises, homework projects, final project, and copies of transparen- 
cies are available on the World Wide Web at http://www.control.lth.se, 
click on Education/Adaptive control. 


Wanted: Feedback 


As teachers and researchers in automatic control, we know the importance 
of feedback. We therefore encourage all readers to write to us about errors, 
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misunderstandings, suggestions for improvements, and also about what may 
be valuable in the material we have presented. 
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CHAPTER 1 


WHAT IS 
ADAPTIVE CONTROL? 


11 INTRODUCTION 


In everyday language, “to adapt” means to change a behavior to conform to 
new circumstances. Intuitively, an adaptive controller is thus a controller that 
can modify its behavior in response to changes in the dynamics of the process 
and the character of the disturbances. Since ordinary feedback also attempts 
to reduce the effects of disturbances and plant uncertainty, the question of the 
difference between feedback control and adaptive control immediately arises. 
Over the years there have been many attempts to define adaptive control 
formally. At an early symposium in 1961 a long discussion ended with the 
following suggestion: “An adaptive system is any physical system that has been 
designed with an adaptive viewpoint.” A renewed attempt was made by an 
IEEE committee in 1973. It proposed a new vocabulary based on notions like 
self-organizing control (SOC) system, parameter-adaptive SOC, performance- 
adaptive SOC, and learning control system. However, these efforts were not 
widely accepted. A meaningful definition of adaptive control, which would make 
it possible to look at a controller hardware and software and decide whether 
or not it is adaptive, 1s still lacking. However, there appears to be a consensus 
that a constant-gain feedback system is not an adaptive system. 

In this book we take the pragmatic attitude that an adaptive controller 
is a controller with adjustable parameters and a mechanism for adjusting 
the parameters. The controller becomes nonlinear because of the parameter 
adjustment mechanism. It has, however, a very special structure. Since general 
nonlinear systems are difficult to deal with, it makes sense to consider special 
classes of nonlinear systems. An adaptive control system can be thought of 
as having two loops. One loop is a normal feedback with the process and the 
controller. The other loop is the parameter adjustment loop. A block diagram 
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Parameter 
adjustment 


Controller 
parameters 


signal 


Figure 1.1 Block diagram of an adaptive system. 


Setpoint 
Output 


of an adaptive system is shown in Fig. 1.1. The parameter adjustment loop is 
often slower than the normal feedback loop. 

A control engineer should know about adaptive systems because they have 
useful properties, which can be profitably used to design control systems with 
improved performance and functionality. 


A Brief History 


In the early 1950s there was extensive research on adaptive control in connec- 
tion with the design of autopilots for high-performance aircraft (see Fig. 1.2). 
Such aircraft operate over a wide range of speeds and altitudes. It was found 
that ordinary constant-gain, linear feedback control could work well in one op- 
erating condition but not over the whole flight regime. A more sophisticated 
controller that could work well over a wide range of operating conditions was 
therefore needed. After a significant development effort it was found that gain 
scheduling was a suitable technique for flight control systems. The interest in 
adaptive control diminished partly because the adaptive control problem was 
too hard to deal with using the techniques that were available at the time. 

In the 1960s there were much research in control theory that contributed 
to the development of adaptive control. State space and stability theory were 
introduced. There were also important results in stochastic control theory. Dy- 
namic programming, introduced by Beliman, increased the understanding of 
adaptive processes. Fundamental contributions were also made by Tsypkin, 
who showed that many schemes for learning and adaptive control could be 
described in a common framework. There were also major developments in 
system identification. A renaissance of adaptive control occurred in the 1970s, 
when different estimation schemes were combined with various design meth- 
ods. Many applications were reported, but theoretical results were very limited. 

In the late 1970s and early 1980s, proofs for stability of adaptive systems 
appeared, albeit under very restrictive assumptions. The efforts to merge ideas 


12 Linear Feedback 8 


— ———— RAOR, 
TREA 


Figure 1.2 Several advanced flight control systems were tested on the X-15 
experimental aircraft. (By courtesy of NASA.) 


of robust control and system identification are of particular relevance. Inves- 
tigation of the necessity of those assumptions sparked new and interesting re- 
search into the robustness of adaptive control, as well as into controllers that 
are universally stabilizing. Research in the late 1980s and early 1990s gave 
new insights into the robustness of adaptive controllers. Investigations of non- 
linear systems led to significantly increased understanding of adaptive control. 
Lately, it has also been established that adaptive control has strong relations 
to ideas on learning that are emerging in the field of computer science. 

There have been many experiments on adaptive control in laboratories 
and industry. The rapid progress in microelectronics was a strong stimulation. 
Interaction between theory and experimentation resulted in a vigorous devel- 
opment of the field. As a result, adaptive controllers started to appear commer- 
cially in the early 1980s. This development is now accelerating. One result is 
that virtually all single-loop controllers that are commercially available today 
allow adaptive techniques of some form. The primary reason for introducing 
adaptive control was to obtain controllers that could adapt to changes in pro- 
cess dynamics and disturbance characteristics. It has been found that adaptive 
techniques can also be used to provide automatic tuning of controllers. 


1.2 LINEAR FEEDBACK 


Feedback by itself has the ability to cope with parameter changes. The search 
for ways to design a system that are insensitive to process variations was in 
fact one of the driving forces for inventing feedback. Therefore it is of interest 
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to know the extent to which process variations can be dealt with by using 
linear feedback. In this section we discuss how a linear controller can deal 
with variations in process dynamics. 


Robust High-Gain Control 


A linear feedback controller can be represented by the block diagram in Fig. 1.3. 
The feedback transfer function Gr is typically chosen so that disturbances 
acting on the process are attenuated and the closed-loop system is insensitive 
to process variations. The feedforward transfer function Gpr is then chosen 
to give the desired response to command signals. The system is called a two- 
degree-of-freedom system because the controller has two transfer functions that 
can be chosen independently. The fact that linear feedback can cope with 
significant variations in process dynamics can be seen from the following 
intuitive argument. Consider the system in Fig. 1.3. The transfer function from 
Ym to yis 
_ _GpGfb 
1+ G,G rp 


Taking derivatives with respect to Gp, we get 


aT _ 1 dG, 
T 14+6,Gp Gp 


The closed-loop transfer function T is thus insensitive to variations in the pro- 
cess transfer function for those frequencies at which the loop transfer function 


L = GpG fo (1.1) 


is large. To design a robust controller, it is thus attempted to find G pẹ such 
that the loop transfer function is large for those frequencies at which there are 
large variations in the process transfer function. For those frequencies where 
L(i@) = 1, however, it is necessary that the variations be moderate for the 
system to have sufficient robustness properties. 


Figure 1.3 Block diagram of a robust high-gain system. 
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Judging Criticality of Process Variations 


We now consider some specific examples to develop some intuition for judging 
the effects of parameter variations. The following example illustrates that 
significant variations in open-loop step responses may have little effect on the 
closed-loop performance. 


EXAMPLE11 Different open-loop responses 
Consider systems with the open-loop transfer functions 


— t _ 
TET 


where a = —0.01, 0, and 0.01. The dynamics of these processes are quite differ- 
ent, as is illustrated in Fig. 1.4{a). Notice that the responses are significantly 
different. The system with a = 0.01 is stable; the others are unstable. The 
initial parts of the step responses, however, are very similar for all systems. 
The closed-loop systems obtained by introducing the proportional feedback with 
unit gain, that is, u = ue — y, give the step responses shown in Fig. 1.4(b}. No- 
tice that the responses of the closed-loop systems are virtually identical. Some 
insight is obtained from the frequency responses. Bode diagrams for the open 
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Figure 1.4 (a) Open-loop unit step responses for the process in Example 1.1 
with a = —0.01, 0, and 0.01. (b) Closed-loop step responses for the same 
system, with the feedback u = u. — y. Notice the difference in time scales. 
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Figure 1.5 (a) Open-loop and (b) closed-loop Bode diagrams for the process 
in Example 1.1. 
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Figure L6 {a) Open-loop unit step responses for the process in Example 1.2 
with T = 0, 0.015, and 0.03. (b) Closed-loop step responses for the same 
system, with the feedback u = u, — y. Notice the difference in time scales. 


and closed loops are shown in Fig. 1.5. Notice that the Bode diagrams for the 
open-loop systems differ significantly at low frequencies but are virtually iden- 
tical for high frequencies. Intuitively, it thus appears that there is no problem 
in designing a controller that will work well for all systems, provided that the 
closed-loop bandwidth is chosen to be sufficiently high. This is also verified 
by the Bode diagrams for the closed-loop systems shown in Fig. 1.5(b), which 
are practically identical. Also compare the step responses of the closed-loop 
systems in Fig. 1.4(b). m 


The next example illustrates that process variations may be significant 
even if changes in the open-loop step responses are small. 


EXAMPLE 1.2 Similar open-loop responses 
Consider systems with the open-loop transfer functions 


400(1 — sT) 


Gols) = Gy ij(e +20) +75) 


with T = 0, 0.015, and 0.03. The open-loop step responses are shown in 
Fig. 1.6(a). Figure 1.6(b) shows the step responses for the closed-loop systems 
obtained with the feedback u = u, — y. Notice that the open-loop responses 
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Figure 1.7 Bode diagrams for the process in Example 1.2. (a) The open-loop 
system; (b) The closed-loop system. 
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are very similar but that the closed-loop responses differ considerably. The 
frequency responses give some insight. The Bode diagrams for the open- and 
closed-loop systems are shown in Fig. 1.7. Notice that the frequency responses of 
the open-loop systems are very close for low frequencies but differ considerably 
in the phase at high frequencies. It is thus possible to design a controller that 
works well for all systems provided that the closed-loop bandwidth is chosen to 
be sufficiently small. At the crossover frequency chosen in the example there 
are, however, significant variations that show up in the Bode diagrams of the 
closed-loop systems in Fig. 1.7(b} and in the step responses of the closed-loop 
system in Fig. 1.6(b). 0 


The examples discussed show that to judge the consequences of process 
variations from open-loop dynamics, it is better to use frequency responses than 
time responses. It is also necessary to have some information about the desired 
crossover frequency of the closed-loop system. Intuitively, it may be expected 
that a process variation that changes dynamics from unstable to stable is very 
severe, Example 1.1 shows that this is not necessarily the case. 


EXAMPLE13 Integrator with unknown sign 
Consider a process whose dynamics is described by 


Gols) = “2 (1.2) 


where the gain kp can assume both positive and negative values. This is a very 
severe variation because the phase of the system can change by 180°. This 
process cannot be controlled by a linear controller with a rational transfer 
function. This can be geen as follows. Let the controller transfer function be 
S(s)/R(s), where R(s} and S(s) are polynomials. Assume that deg R > deg S. 
The characteristic polynomial of the closed-loop system is then 


P(s) = sR(s) + k,S(s) 


Without lack of generality it can be assumed that the coefficient of the highest 
power of s in the polynomial R(s) is 1. The coefficient of the highest power 
of s of P(s) is thus also 1. The constant coefficient of polynomial k,S(s) is 
proportional to kp and can thus be either positive or negative. A necessary 
condition for P(s} to have all roots in the left half-plane is that all coefficients 
are positive. Since kp can be both positive and negative, the polynomial P(s) 
will always have a zero in the right half-plane for some value of kp. oO 


1.3 EFFECTS OF PROCESS VARIATIONS 


The standard approach to control system design is to develop a linear model 
for the process for some operating condition and to design a controller having 
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constant parameters. This approach has been remarkably successful. A funda- 
mental property is also that feedback systems are intrinsically insensitive to 
modeling errors and disturbances. In this section we illustrate some mecha- 
nisms that give rise to variations in process dynamics. We also show the effects 
of process variations on the performance of a control system. 

The examples are simplified to the extent that they do not create significant 
control problems but do illustrate some of the difficulties that might occur in 
real systems. 


Nonlinear Actuators 


A very common source of variations is that actuators, like valves, have a 
nonlinear characteristic. This may create difficulties, which are illustrated by 
the following example. 


' EXAMPLE 1.4 Nonlinear valve 


A simple feedback loop with a Proportional and Integrating (PI) controller, 
a nonlinear valve, and a process is shown in Fig. 1.8. Let the static valve 
characteristic be 


v=f(u)=ut u>O0 


Linearizing the system around a steady-state operating point shows that the 
incremental gain of the valve is f’(u), and hence the loop gain is proportional 
to f'(u}. The system can perform well at one operating level and poorly at 
another. This is illustrated by the step responses in Fig. 1.9. The controller is 
tuned to give a good response at low values of the operating level. For higer 
values of the operating level the closed-loop system even becomes unstable. 
One way to handle this type of problem is to feed the control signal u through 
an inverse of the nonlinearity of the valve. It is often sufficient to use a fairly 
crude approximation (see Example 9.1}. This can be interpreted as a special 
case of gain scheduling, which is treated in detail in Chapter 9. O 


FI controller Valve Process 


Figure 1.8 Block diagram of a flow control loop with a PI controller and a 
nonlinear valve. 
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Figure 1.9 Step responses for PI control of the simple flow loop in Ex- 
ample 1.4 at different operating levels. The parameters of the PI controller 
are K = 0.15, T; = 1. The process characteristics are f(u} = u‘ and 
Go(s) = 1/{s + 1}. 


Flow and Speed Variations 


Systems with flows through pipes and tanks are common in process control. 
The flows are often closely related to the production rate. Process dynamics 
thus change when the production rate changes, and a controller that is well 
tuned for one production rate will not necessarily work well for other rates. 
A simple example illustrates what may happen. 


EXAMPLE 1.5 Concentration control 


Consider concentration control for a fluid that flows through a pipe, with no 
mixing, and through a tank, with perfect mixing. A schematic diagram of the 
process is shown in Fig. 1.10. The concentration at the inlet of the pipe is cin. 
Let the pipe volume be Vz and let the tank volume be Vm. Furthermore, let the 
flow be g and let the concentration in the tank and at the outlet be c. A mass 
balance gives 

Va E = a(t) (cinlt—1) — cl) (1.3) 


where 
t = Va/q(t) 
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Figure 1.10 Schematic diagram of a concentration control system. 


Introduce 
T = V,,/qa(é) (1.4) 


For a fixed flow, that is, when g(t} is constant, the process has the transfer 
function 


Go(s) = 


ett 

1+sT (1.5) 
The dynamics are characterized by a time delay and first-order dynamics. The 
time constant T and the time delay 7 are inversely proportional to the flow gq. 

The closed-loop system is as in Fig. 1.8 with f(-} = 1 and Go(s) given 
by Eq. (1.5). A controller will first be designed for the nominal case, which 
corresponds to g = 1, T = 1, and t=1. A PI controller with gain K = 0.5 and 
integration time T; = 1.1 gives a closed-loop system with good performance 
in this case. Figure 1.11 shows the step responses of the closed-loop system 
for different flows and the corresponding control actions. The overshoot will 
increase with decreasing flows, and the system will become sluggish when the 
flow increases. For safe operation it is thus good practice to tune the controller 
at the lowest flow. Figure 1.11 shows that the system can easily cope with a 
flow change of +10% but that the performance deteriorates severely when the 
flow changes by a factor of 2. oO 


Variations in speed give rise to similar problems. This happens for example 
in rolling mills and paper machines. 


Flight Control 


The dynamics of an airplane change significantly with speed, altitude, angle 
of attack, and so on. Control systems such as autopilots and stability augmen- 
tation systems were used early. These systems were based on linear feedback 
with constant coefficients. This worked well when speeds and altitudes were 
low, but difficulties were encountered with increasing speed and altitude. The 
problems became very pronounced at supersonic flight. Flight control was one 
of the strong driving forces for the early development of adaptive control. The 
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Figure 1.11 Change in reference value for different flows for the system in 
Example 1.5. (a} Output e and reference c, concentration, (b} control signal. 


following example from Ackermann (1983) illustrates the variations in dynam- 
ics that can be encountered. The variations can be even larger for aircraft with 
larger variations in flight regimes. 


EXAMPLE 1.6 . Short-period aircraft dynamics 


A schematic diagram of an airplane is given in Fig. 1.12. To illustrate the 
effect of parameter variations, we consider the pitching motion of the aircraft. 
Introduce the pitch angle @. Choose normal acceleration N,, pitch rate q = @, 


Figure L12 Schematic diagram of the aircraft in Example 1.6. 
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and elevon angle 6, as state variables and the input to the elevon servo as the 
input signal u. The following model is obtained if we assume that the aircraft 


is a rigid body: 
dx Gu a g by 
dp = | G21 G22 Gas | xt 0O ju (1.6) 
0 0 —a a 


where x? = (N; @ ôe). This model is called short-period dynamics. The 
parameters of the model given depend on the operating conditions, which can 
be described in terms of Mach number and altitude; see Fig. 1.13, which shows 
the flight envelope. 

Table 1.1 shows the parameters for the four flight conditions (FC) indicated 
in Fig. 1.13. The data applies to the supersonic aircraft F4-E. The system has 
three eigenvalues. One eigenvalue, —a = —14, which is due to the elevon servo, 
is constant. The other eigenvalues, 4; and Az, depend on the flight conditions. 
Table 1.1 shows that the system is unstable for subsonic speeds (FC 1, 2, and 
3} and stable but poorly damped for the supersonic condition FC 4. Because of 
these variations it is not possible to use a controller with the same parameters 
for all flight conditions. The operating condition is determined from air data 
sensors that measure altitude and Mach number. The controller parameters are 
then changed as a function of these parameters. How this is done is discussed 
in Chapter 9. 

Much more complicated models will have to be considered in practice be- 
cause the airframe is elastic and will bend. Notch prefilters on the command 
signal from the pilot are also used so that the control actions will not excite 
the bending modes of the airplane. oO 


BO 


60 


40 


Altitude (x1000 ft} 


20 


Mach number 


Figure 113 Flight envelope of the F4-E. Four different flight conditions are 
indicated. (From Ackermann (1983), courtesy of Springer-Verlag.) 
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Table 1.1 Parameters of the airplane state model of Eq. (1.6) for different 
flight conditions (FC). 


FC 1 FC 2 FC 3 FC 4 
Mach 0.5 0.85 0.9 1.5 
Altitude (feet) 5000 5000 35000 _ 35000 

an 0.9896 —1.702 —0,667 —0.5162 

az 17.41 50.72 18.11 26.96 

Qis 96.15 263.5 84.34 178.9 

di 0.2648 0.2201 0.08201 —0.6896 

an —0.8512  —1.418 -0.6587  —1.225 

aa —11.39 —81,99 —10.81 —30.38 

b, —97.78  —2722 —85.09 —1758 

ay ~3.07 —4.90 —1.87 l 

de 1.23 1.78 0.56 SMERE 


Variations in Disturbance Characteristics 


So far, we have discussed effects of variations in process dynamics. There are 
also situations in which the key issue is variations in disturbance characteris- 
tics. Two examples follow. 


: EXAMPLE1.7 Ship steering 


A key problem in the design of an autopilot for ship steering is to compensate for 
the disturbing forces that act on the ship because of wind, waves, and current. 
The wave-generated forces are often the dominating forces. Waves have strong 
periodic components. The dominating wave frequency may change by a factor 
of 3 when the weather conditions change from light breeze to fresh gale. The 
frequency of the forces generated by the waves will change much more because 
it is also influenced by the velocity and heading of the ship. Examples of wave 
height and spectra for two weather conditions are shown in Fig. 1.14. It seems 
natural to take the nature of the wave disturbances into account in designing 
autopilots and roll dampers. Since the wave-induced forces change so much, it 
seems natural to adjust the controller parameters to cope with the disturbance 
characteristics. o 


Fositioning of ships and platforms is another example that is similar to 
ship steering. In this case the control system will typically have less control 
authority. This means that the platform to a greater extent has to “ride the 
waves” and can compensate only for a low-frequency component of the distur- 
bances. This makes it even more critical to have a model for the disturbance 
pattern. 
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Figure 1.14 Measurements and spectra of waves at different weather con- 
ditions at Hoburgen, Sweden. {a} Wind speed 3-4 m/s. (b} Wind speed 18-20 
m/s. (Courtesy of SSPA Maritime Consulting AB, Sweden.) 


In process control the key issue is often to perform accurate regulation. 
For important quality variables, even moderate reductions in the fluctuation 
of a quality variable can give substantial savings. If the disturbances have 
some statistical regularity, it is possible to obtain significant improvements in 
control quality by having a controller that is tuned to the particular character 
of the disturbance. Such controllers can give much better performance than 
standard PI controllers. The consequences of compensating for disturbances 
are illustrated by an example. 


EXAMPLE18 Regulation of a quality variable in process control 


Consider regulation of a quality variable of an industrial process in which 
there are disturbances whose characteristics are changing. A block diagram 
of the system is shown in Fig. 1.15. In the experiment it is assumed that the 
process dynamics are first order with time constant T = 1. It is assumed 
that the disturbance acts on the process input. The disturbance is simulated 
by sending white noise through a band-pass filter. The process dynamics are 
constant, but the frequency of the band-pass filter changes. Regulation can be 
done by a PI controller, but performance can be improved significantly by using 
a more complex controller that is tuned to the disturbance character. Such a 
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Figure 1.15 Block diagram of the system with disturbances used in Exam- 
ple 1.8. 


controller has a very high gain at the center frequency of the disturbance. 
Figure 1.16 shows the control error under different conditions. The center 
frequency of the band-pass filter used to generate the disturbance is œw, and the 
corresponding value used in the design of the controller is w,. In Fig. 1.16(a} we 
show the contro! error obtained when the controller is tuned to the disturbance, 


(a) Output error 
1 
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(b) Output error 
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(c) Output error 
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Figure 1.16 [Illustrates performance of controllers that are tuned to the 
disturbance characteristics. Output error when {a) œ = @, = 0.1; (b) @ = 
0.05, Oe = 0.1; (c) w = w, = 0.05. 
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that is, @ = wo = 0.1, In Fig. 1.16(b} we illustrate what happens when 
the disturbance properties change. Parameter @ is changed to 0.05, while 
@, = 0.1. The performance of the control system now deteriorates significantly. 
In Fig. 1.16(c} we show the improvement obtained by tuning the controller to 
the new conditions, that is, @ = w, = 0.05. g 


There are many other practical problems of a similar type in which there 
are significant variations in the disturbance characteristics. Having a con- 
troller that can adapt to changing disturbance patterns is particularly im- 
portant when there is limited control authority or dead time in the process 
dynamics. 


Summary 


The examples in this section illustrate some mechanisms that can create vari- 
ations in process dynamics. The examples have of necessity been very simple 
to show some of the difficulties that may occur. In some cases it is straightfor- 
ward to reduce the variations by introducing nonlinear compensations in the 
controllers. For the nonlinear valve in Example 1.4 it is natural to introduce 
a nonlinear compensator at the controller output that is the inverse of the 
valve characteristics. This modification is done in Example 9.1. The variations 
in flow rate in Example 1.5 can be dealt with in a similar way by measuring 
the flow and changing the controller parameters accordingly. To compensate 
for the variations in dynamics in Example 1.6, it is necessary to measure the 
flight conditions. In Examples 1.7 and 1.8, in which the variations are due to 
changes in the disturbances, it is not possible to directly relate the variation 
to a measurable quantity. In these cases it may be very advantageous to use 
adaptive control. 

In practice there are many different sources of variations, and there is usu- 
ally a mixture of different phenomena. The underlying reasons for the varia- 
tions are in most cases not fully understood. When the physics of the process 
is reasonably well known (as for airplanes), it is possible to determine suit- 
able controller parameters for different operating conditions by linearizing the 
models and using some method for control design. This is the common way to 
design autopilots for airplanes. System identification is an alternative to phys- 
ica] modeling. Both approaches do, however, require a significant engineering 
effort. 

Most industrial processes are very complex and not well understood; it is 
neither possible nor economical to make a thorough investigation of the causes 
of the process variations. Adaptive controllers can be a good alternative in such 
cases. In other situations, some of the dynamics may be well understood, but 
other parts are unknown. A typical example is robots, for which the geometry, 
motors, and gearboxes do not change but the load does change. In such cases 
it is of great importance to use the available a priori knowledge and estimate 
and adapt only to the unknown part of the process. 
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14 ADAPTIVE SCHEMES 


In this section we describe four types of adaptive systems: gain scheduling, 
model-reference adaptive control, self-tuning regulators, and dual control. 


Gain Scheduling 


In many cases it is possible to find measurable variables that correlate well 
with changes in process dynamics. A typical case is given in Example 1.4. These 
variables can then be used to change the controller parameters. This approach 
is called gain scheduling because the scheme was originally used to measure 
the gain and then change, that is, schedule, the controller to compensate for 
changes in the process gain. A block diagram of a system with gain scheduling 
is shown in Fig. 1.17. The system can be viewed as having two loops. There 
is an inner loop composed of the process and the controller and an outer loop 
that adjusts the controller parameters on the basis of the operating conditions. 
Gain scheduling can be regarded as a mapping from process parameters to 
controller parameters. It can be implemented as a function or a table lookup. 

The concept of gain scheduling originated in connection with the devel- 
opment of flight control systems. In this application the Mach number and 
the altitude are measured by air data sensors and used as scheduling vari- 
ables. This was used, for instance, in the X-15 in Fig. L2. In process control 
the production rate can often be chosen as a scheduling variable, since time 
constants and time delays are often inversely proportional to production rate. 
Gain scheduling is thus a very useful technique for reducing the effects of pa- 
rameter variations. Historically, it has been a matter of controversy whether 
gain scheduling should be considered an adaptive system or not. If we use the 
informa) definition in Section 1.1 that an adaptive system is a controller with 
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Output 


Figure 1.17 Block diagram of a system with gain scheduling. 
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adjustable parameters and an adjustment mechanism, it is clearly adaptive. 
An in-depth discussion of gain scheduling is given in Chapter 9. 


Model-Reference Adaptive Systems (MRAS) 


The model-reference adaptive system (MRAS) was originally proposed to solve 
a problem in which the performance specifications are given in terms of a 
reference model. This model tells how the process output ideally should respond 
to the command signal. A block diagram of the system is shown in Fig. 1.18. 
The controller can be thought of as consisting of two loops. The inner loop is an 
ordinary feedback loop composed of the process and the controller. The outer 
loop adjusts the controller parameters in such a way that the error, which is 
the difference between process output y and model output ym, is small. The 
MRAS was originally introduced for flight contro!. In this case the reference 
model describes the desired response of the aircraft to joystick motions. 

The key problem with MRAS is to determine the adjustment mechanism so 
that a stable system, which brings the error to zero, is obtained. This problem 
is nontrivial. The following parameter adjustment mechanism, called the MIT 
rule, was used in the original MRAS: 


(1.7) 


In this equation, e = y — ym denotes the model error and @ is a controller 
parameter. The quantity 0¢/0@ is the sensitivity derivative of the error with 
respect to parameter @. The parameter y determines the adaptation rate. 
In practice it is necessary to make approximations to obtain the sensitivity 
derivative. The MIT rule can be regarded as a gradient scheme to minimize 
the squared error e”. 


Adjustment 
mechanism 


Figure 1.18 Block diagram of a model-reference adaptive system (MRAS). 
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Self-tuning Regulators (STR) 


The adaptive schemes discussed so far are called direct methods, because the 
adjustment rules tell directly how the controller parameters should be updated. 
A different scheme is obtained if the estimates of the process parameters are 
updated and the controller parameters are obtained from the solution of a 
design problem using the estimated parameters. A block diagram of such a 
system is shown in Fig. 1.19. The adaptive controller can be thought of as 
being composed of two loops. The inner loop consists of the process and an 
ordinary feedback controller. The parameters of the controller are adjusted by 
the outer loop, which is composed of a recursive parameter estimator and a 
design calculation. It is sometimes not possible to estimate the process param- 
eters without introducing probing control signals or perturbations. Notice that 
the system may be viewed as an automation of process modeling and design, 
in which the process mode] and the control design are updated at each sam- 
pling period. A controller of this construction is called a self-tuning regulator 
(STR) to emphasize that the controller automatically tunes its parameters to 
obtain the desired properties of the closed-loop system. Self-tuning regulators 
are discussed in detail in Chapters 3 and 4. 

The block labeled “Controller design” in Fig. 1.19 represents an on-line 
solution to a design problem for a system with known parameters. This is 
the underlying design problem. Such a problem can be associated with most 
adaptive control schemes, but it is often given indirectly. To evaluate adaptive 
control schemes, it is often useful to find the underlying design problem, 
because it will give the characteristics of the system under the ideal conditions 
when the parameters are known exactly. 

The STR scheme is very flexible with respect to the choice of the under- 
lying design and estimation methods. Many different combinations have been 
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Figure 1.19 Block diagram of a self-tuning regulator (STR). 
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explored. The controller parameters are updated indirectly via the design cal- 
culations in the self-tuner shown in Fig. 1.19. It is sometimes possible to repa- 
rameterize the process so that the model can be expressed in terms of the 
controller parameters. This gives a significant simplification of the algorithm 
because the design calculations are eliminated. In terms of Fig. 1.19 the block 
labeled “Controller design” disappears, and the controller parameters are up- 
dated directly. 

In the STR the controller parameters or the process parameters are esti- 
mated in real time. The estimates are then used as if they are equal to the true 
parameters (i.e., the uncertainties of the estimates are not considered). This 
is called the certainty equivalence principle. In many estimation schemes it is 
also possible to get a measure of the quality of the estimates. This uncertainty 
may then be used in the design of the controller. For example, if there is a 
Jarge uncertainty, one may choose a conservative design. This is discussed in 
Chapter 7. 


Dual Control 


The schemes for adaptive control described so far look like reasonable heuristic 
approaches. Already from their description it appears that they have some 
limitations, For example, parameter uncertainties are not taken into account 
in the design of the controller. It is then natural to ask whether there are better 
approaches than the certainty equivalence scheme. We may also ask whether 
adaptive controllers can be obtained from some general principles. It is possible 
to obtain a solution that follows from an abstract problem formulation and use 
of optimization theory. The particular tool one could use is nonlinear stochastic 
control theory. This will lead to the notion of dual control. The approach will 
give a controller structure with interesting properties. A major consequence is 
that the uncertainties in the estimated parameters will be taken into account 
in the controller, The controller will also take special actions when it has poor 
knowledge about the process, The approach is so complicated, however, that so 
far it has not been possible to use it for practical problems. Since the ideas are 
conceptually useful, we will discuss them briefly in this section. 

The first problem that we are faced with is to describe mathematically the 
idea that a constant or slowly varying parameter is unknown. An unknown 
constant can be modeled by the differential equation 

do 

Pri 0 (1.8) 
with an initial distribution that reflects the parameter uncertainty. Parameter 
drift can be described by adding random variables to tbe right-hand side of 
Eq. (1.8). A model of a plant with uncertain parameters is thus obtained 
by augmenting the state variables of the plant and its environment by the 
parameter vector whose dynamics is given by Eq. (1.8). Notice that with this 
formulation there is no distinction between these parameters and the other 
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Figure 1.20 Block diagram of a dual controller. 


state variables, This means that the resulting controller can handle very rapid 
parameter variations. An augmented state z = (x? 8T )* consisting of the 
state of the process and the parameters can now be introduced. The goal of the 
control is then formulated to minimize a loss function 


T 
V=E G (2(T),u(T)) + f g(z, u) ae) 


where & denotes mathematical expectation, u is the control variable, and G 
and g are scalar functions of z and u. The expectation is taken with respect 
to the distribution of all initial values and all disturbances appearing in the 
models of the system. The criterion V should be minimized with respect to 
admissible controls that are such that u(t) is a function of past and present 
measurements and the prior distributions. The problem of finding a controller 
that minimizes the loss function is difficult. By making sufficient assumptions 
a solution can be obtained by using dynamic programming. The solution is then 
given in terms of a functional equation that is called the Bellman equation. 
This equation is an extension of the Hamilton-Jacobi equation in the calculus 
of variations. It is very difficult and time-consuming, if at all possible, to solve 
the Bellman equation numerically. 

Some structural properties are shown in Fig. 1.20. The controller can be 
regarded as being composed of two parts: a nonlinear estimator and a feedback 
controller. The estimator generates the conditional probability distribution of 
the state from the measurements, p(z|y,u). This distribution is called the hy- 
perstate of the problem. The feedback controller is a nonlinear function that 
maps the hyperstate into the space of control variables. This function could 
be computed off-line. The hyperstate must, however, be updated on-line. The 
structural simplicity of the solution is obtained at the price of introducing the 
hyperstate, which is a quantity of very high dimension. Updating of the hyper- 
state generally requires solution of a complicated nonlinear filtering problem. 
In simple cases the distribution can be characterized by its mean and covari- 
ance, as will be shown in Chapter 7. 

The optimal controller sometimes has some interesting properties, which 
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have been found by solving a number of specific problems. It attempts to 
drive the output to its desired value, but it will also introduce perturbations 
(probing) when the parameters are uncertain. This improves the quality of the 
estimates and the future performance of the closed-loop system. The optimal 
control gives the correct balance between maintaining good control and small 
estimation errors. The name dual control was coined to express this property. 

It is interesting to compare the controller in Fig. 1.20 with the self-tuning 
regulator in Fig. 1.19. In the STR the states are separated into two groups: 
the ordinary state variables of the underlying constant parameter model] and 
the parameters, which are assumed to vary slowly. The parameter estimator 
may be considered as an observer for the parameters. Notice that many esti- 
mators will also provide estimates of the uncertainties, although this is not 
used in calculating the control signal. The calculation of the hyperstate in the 
dual controller gives the conditional distribution of all states and all parame- 
ters of the process. The conditional mean value represents estimates, and the 
conditional covariances give the uncertainties of the estimates. Uncertainties 
are not used in computing the control signal in the self-tuning regulator. They 
are important for the dual controller because it may automatically mtroduce 
perturbations when the estimates are poor. Dual control is discussed in more 
detail in Chapter 7. 


1.5 THE ADAPTIVE CONTROL PROBLEM 


In this section we formulate the adaptive control problem. We do this by 
giving examples of process models, controller structures, and ways to adapt 
the controller parameters. 


Process Descriptions 


In this book the processes will mainly be described by linear single-input, 
single-output systems. In continuous time the process can be in state space 
form: 


dx 
y= Cx 
or in transfer function form: 
m --1 ate 
G(s) = B(S) _ bos + bis” + bn «110 


A(s) tas +..-a, 
where s is the Laplace transform variable. Notice that A, B, and C are used 
for matrices as well as polynomials. In normal cases this will not cause any 


misunderstanding. In ambiguous cases the argument will be used in the poly- 
nomials. 
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In discrete time the process can be described in state space form: 
x(t + 1) = bx(t) + Tu(é) 
y(t) = Cx(t) 


where the sampling interval is taken as the time unit. The discrete time system 
can also be represented by the pulse transfer function 


B(z) _ boz” + bz") +--+ bp 
A(z) Z +a! an 


H,(2) = (1.11) 
where z is the z-transform variable. 

The parameters, bo, bi,...,bm,@1,.++,4n of systems (1.10) and (1.11) as well 
as the orders m,n are often assumed to be unknown or partly unknown. 


A Remark on Notation 


Throughout this book we need a convenient notation for the time functions 
obtained in passing signals through linear systems, For this purpose we will 
use the differential operator p = d/dt. The output of the system with the 
transfer function G(s) when the input signal is u(t) will then be denoted by 


y(t) = G(p)u(t) 


The output will also depend on the initial conditions. In using the above 
notation it is assumed that all initial conditions are zero. To deal with discrete 
time systems, we introduce the forward shift operator q defined by 


qy(t) = y(t + 1) 
The output of a system with input u and pulse transfer function H(z) is denoted 


by 
y(t) = H(q}u(t) 
In this case it is also assumed that all initial conditions are zero. 


Controller Structures 


The process is controlled by a controller that has adjustable parameters. It is 
assumed that there exists some kind of design procedure that makes it possible 
to determine a controller that satisfies some design criteria if the process and 
its environment are known. This is called the underlying design problem. The 
adaptive control problem is then to find a method of adjusting the controller 
when the characteristics of the process and its environment are unknown or 
changing. In direct adaptive control the controller parameters are changed 
directly without the characteristics of the process and its disturbances first 
being determined, In indirect adaptive methods the process model and possibly 
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the disturbance characteristics are first determined. The controller parameters 
are designed on the basis of this information. 

One key problem is the parameterization of the controller. A few examples 
are given to illustrate this. 


. EXAMPLE1.9 Adjustment of gains in a state feedback 

Consider a single-input, single-output process described by Eq. (1.9). Assume 

that the order n of the process is known and that the controller is described by 
u = =Lx 


In this case the controller is parameterized by the elements of the matrix L. 
o 


EXAMPLE 1.10 A general linear controller 
A general linear controller can be described by 


R(s)U(s) = —S(s)¥(s) + T(s)U,(s) 


where R, S, and T are polynomials and U, Y, and U, are the Laplace transform 
of the control signal, the process output, and the reference value, respectively. 
Several design methods are available to determine the parameters in the con- 
troller when the system is known. D 


In Examples 1.9 and 1.10 the controller is linear. Of course, parameters can 
also be adjusted in nonlinear controllers. A common example is given next. 


_EXAMPLE1.11 Adjustment of a friction compensator 


Friction is common in all mechanical systems. Consider a simple servo drive. 
Friction can to some extent be compensated for by adding the signal uy, to a 
controller, where 


—u ifv<0 


where v is the velocity, The signal attempts to compensate for Coulomb fric- 
tion by adding a positive control signal u, when the velocity is positive and 
subtracting u_ when the velocity is negative. The reason for having two param- 
eters is that the friction forces are typically not symmetrical. Since there are 
so many factors that influence friction, it is natural to try to find a mechanism 
that can adjust the parameters u, and u_ automatically. oO 


ifv>0O 
un={ Ht 


The Adaptive Control Problem 


An adaptive controller has been defined as a controller with adjustable param- 
eters and a mechanism for adjusting the parameters. The construction of an 
adaptive controller thus contains the following steps: 
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» Characterize the desired behavior of the closed-loop system. 
e Determine a suitable control law with adjustable parameters. 
e Find a mechanism for adjusting the parameters. 
e Implement the control law. 
In this book, different ways to derive the adjustment rule will be discussed. 


16 APPLICATIONS 


There have been a number of applications of adaptive feedback control since 
the mid-1950s. The early experiments, which used analog implementations, 
were plagued by hardware problems. Systems implemented by using minicom- 
puters appeared in the early 1970s. The number of applications has increased 
drastically with the advent of the microprocessor, which has made the tech- 
nology cost-effective. Adaptive techniques have been used in regular industrial 
controllers since the early 1980s. Today, a large number of industrial control 
loops are under adaptive control. These include a wide range of applications in 
aerospace, process control, ship steering, robotics, and automotive and biomed- 
ical systems. The applications have shown that there are many cases in which 
adaptive control is very useful, others in which the benefits are marginal, and 
yet others in which it is inappropriate. On the basis of the products and their 
uses, it is clear that adaptive techniques can be used in many different ways. 
In this section we give a brief discussion of some applications. More details are 
given in Chapter 12. 


Automatic Tuning 


The most widespread applications are in automatic tuning of controllers. By 
automatic tuning we mean that the parameters of a standard controller, for in- 
stance a PID controller, are tuned automatically at the demand of the operator. 
After the tuning, the parameters are kept constant. Practically all controllers 
can benefit from tools for automatic tuning. This will drastically simplify the 
use of controllers. Practically all adaptive techniques can be used for auto- 
matic tuning. There are also many special techniques that can be used for this 
purpose. Single-loop controllers and distributed systems for process control are 
important application areas. Most of these controllers are of the PID type. This 
is a vast application area because there are millions of controllers of this type 
in use. Many of them are poorly tuned. 

Although automatic tuning is currently widely used in simple controllers, 
it is also beneficial for more complicated controllers. It is in fact a prerequisite 
for the widespread use of more advanced control algorithms. A mechanism for 
automatic tuning is often necessary to get the correct time scale and to find 
a starting value for a more complex adaptive controller. The main advantage 
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Figure 1.21 Gain scheduling is an important ingredient in modern flight 
control systems. 


of using an automatic tuner is that it simplifies tuning drastically and thus 
contributes to improved control quality. Tuners have also been developed for 
other standard applications such as motor control. This is also a case in which 
a fairly standardized system has to be applied to a wide variety of applications. 


Gain Scheduling 


Gain scheduling is a powerful technique that is straightforward and easy to 
use. The key problem is to find suitable scheduling variables, that is, variables 
that characterize the operating conditions {see Fig. 1.17). Typical choices are 
filtered versions of process input, process output or external variables. It may 
also be a significant engineering effort to determine the schedules. This effort 
can be reduced significantly by using automatic tuning because the schedules 
can then be determined experimentally. Auto-tuning or adaptive algorithms 
may be used to build gain schedules. A scheduling variable is first determined. 
Its range is quantized into a number of discrete operating conditions. The 
controller parameters are determined by automatic tuning when the system is 
running in one operating condition. The parameter values are stored in a table. 
The procedure is repeated until all operating conditions are covered. In this 
way it is easy to install and tune gain scheduling into a computer-controlled 
system. The only facility required is a table for storing and recalling controller 
parameters. 

Gain scheduling is the standard technique used in flight control systems 
for high-performance aircrafts. An example is given in Fig. 1.21, In this case 
the scheduling variables are Mach number and height. A massive engineering 
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effort is required to develop such systems. Gain scheduling is increasingly being 
used for industrial process control. It is easy to implement using standard 
systems for distributed control. A combination with automatic tuning makes 
it possible to significantly reduce the engineering effort in developing the 
systems such a procedure cannot be used for flight control because of the severe 
requirement on verification. 


Continuous Adaptation 


There are several cases in which the process or the disturbance characteristics 
are changing continuously. Continuous adaptation of controller parameters is 
then needed. The MRAS and the STR are the most common approaches for 
parameter adjustment. There are many different ways to use the techniques. 
In some cases, it 1s natural to assume that the process is described by a gen- 
eral linear model. In other cases, parts of the model are known and only a few 
parameters are adjusted. In many situations it is possible to measure the dis- 
turbances acting on a system. A typical example is climate control in houses in 
which the outdoor temperature can be measured. The process of using the mea- 
surable disturbance and compensating for its influence is called feedforward. 
Adaptation of feedforward compensators has been found particularly beneficial. 
One reason for this is that feedforward control requires good models. Another 
is that it is difficult and time consuming to tune feedforward loops because 
it is necessary to wait for a proper disturbance to appear. Adaptation is thus 
almost a prerequisite for using feedforward control. 

Since adaptive control is a relatively new technology, there is limited ex- 
perience of its use in products. One observation that has been made is that 
the human-machine interface is very important. Adaptive controllers also have 
their own parameters, which must be chosen. It has been our experience that 
controllers without any externally adjusted parameters can be designed for 
specific applications in which the purpose of control can be stated a priori, Au- 
topilots for missiles and ships are typical examples. However, in many cases it 
is not possible to specify the purpose of control a priori. It is at least necessary 
to tell the controller what it is expected to do. This can be done by introducing 
dials that give the desired properties of the closed-loop system. Such dials are 
performance-related. New types of controllers can be designed by using this 
concept. For example, it is possible to have a controller with one dial, labeled 
with the desired closed-loop bandwidth. This is very convenient for applica- 
tions to motor control. Another possibility would be to have a controller with 
a dial labeled with the weighting between state deviation and control action 
in a quadratic optimization problem. Adaptation can also be combined with 
gain scheduling. A gain schedule can be used to get the parameters quickly 
into the correct region, and adaptation can then be used for fine-tuning, On 
the whole it appears that there is significant room for engineering ingenuity 
in the packaging of adaptive techniques. 
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Abuses of Adaptive Control 


An adaptive controller, being inherently nonlinear, is more complicated than 
a fixed-gain controller. Before attempting to use adaptive control, it is there- 
fore important to investigate whether the control problem might be solved by 
constant-gain feedback. In the literature on adaptive control there are many 
cases in which constant-gain feedback can do as well as an adaptive controller. 
This is one reason why we are discussing alternatives to adaptive control in 
this book. One way to proceed in deciding whether adaptive control should be 
used is sketched in Fig. 122. 


Varying Constant 


Use a controller with Use a controller with 
varying parameters constant parameters 
Unpredictable Predictable 

variations variations 


Use an adaptive 
controller 


Figure 1.22 Procedure to decide what type of controller to use. 


Use gain scheduling 


Industrial Products 


The industria) products can, broadly speaking, be divided into three different 
categories: standard controllers, distributed control systems, and dedicated 
special-purpose systems. 

Standard controllers form the largest category. They are typically based on 
some version of the PID algorithm. Currently, there is very vigorous develop- 
ment of these systems, which are manufactured in large quantities. Practically 
all new single-loop controllers introduced use some form of adaptation. Many 
different schemes are used. The single-loop controller is in fact becoming a 
proving ground for adaptive control. One example is shown in Fig. 1.28. This 
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Figure 1.23 Two generations of commercial PID controllers with automatic 
tuning, gain scheduling, and feedforward. The left figure shows the ECA40 
from SattControl Instruments which was the first stand alone controller with 
relay auto-tuning. The right figures shows a more recent version from ABB. 
Tuning is performed on operator demand when the tune button is pushed. 


system has automatic tuning of the PID controller. The controller also has feed- 
forward and gain scheduling. The automatic tuning is implemented in such a 
way that the user only has to push a button to execute the tuning. 

A standard controller may be regarded as automation of the actions of 
a process operator. The controller shown in Fig. 1.23 may be viewed as the 
next level of automation, in which the actions of an instrument engineer are 
automated. 

Distributed control systems are general-purpose systems primarily for pro- 
cess contro] applications. These systems may be viewed as a toolbox for im- 
plementing a wide variety of control systems. In addition to tools for PID con- 
trol, alarm, and startup, more advanced control schemes are also incorporated. 
Adaptive techniques are now being introduced in the distributed systems, al- 
though the rate of development is not as rapid as for single-loop controllers. 
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There are many special-purpose systems for adaptive control. The applica- 
tions range from space vehicles to automobiles and consumer electronics. The 
spacecraft Gemini, for example, has an adaptive notch filter and adaptive fric- 
tion compensation. The following is another example of an adaptive controller. 


| EXAMPLE 1.12 An adaptive autopilot for ship steering 


This is an example of a dedicated system for a special application. The adap- 
tive autopilot is superior to a conventional autopilot for two reasons: It gives 
better performance, and it is easter to operate. A conventional autopilot has 
three dials, which have to be adjusted over a continuous scale. The adaptive 
autopilot has a performance-related switch with two positions (tight steering 
and economic propulsion). In the tight steering mode the autopilot gives good, 
fast response to commands with no consideration for propulsion efficiency. In 
the economic propulsion mode the autopilot attempts to minimize the steering 
loss. The control performance is significantly better than that of a well-adjusted 
conventional autopilot, as shown in Fig. 1.24. The figure shows heading devia- 
tions and rudder motions for an adaptive autopilot and a conventional autopilot. 
The experiments were performed under the same weather conditions. Notice 
that the heading deviations for the adaptive autopilot are much smaller than 
those for the conventional autopilot but that the rudder motions are of the same 
magnitude. The adaptive autopilot is better because it uses a more complicated 
control law, which has eight parameters instead of three for the conventional 
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Figure 1.24 The figure shows the variations in heading and the correspond- 
ing rudder motions of a ship. (a) Adaptive autopilot. (b) Conventional autopi- 
lot based on a PID-like algorithm. 
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autopilot. For example, the adaptive autopilot has an internal model of the 
wave motion. If the adaptation mechanism is switched off, the constant pa- 
rameter controller obtained will perform well for a while, but its performance 
will deteriorate as the conditions change. Since it is virtually impossible to 
adjust eight parameters manually, adaptation is a necessity for using such a 
controller. The adaptive autopilot is discussed in more detail in Chapter 12. O 


The next example illustrates a general-purpose adaptive system. 


EXAMPLE 1.13 Novatune 


The first general-purpose adaptive system was Novatune, announced by the 
Swedish company Asea in 1982. The system can be regarded as a software- 
configured toolbox for solving control problems. It broke with conventional 
process control by using a general-purpose discrete-time pulse transfer function 
as the building block. The system also has elements for conventional PI and 
PID control, lead-lag filter, logic, sequencing, and three modules for adaptive 
control. It has been used to implement control systems for a wide range of 
process control problems. The advantage of the system is that the control 
system designer has a simple means of introducing adaptation. The adaptive 
controller is now incorporated in ABB Master (see Chapter 12). o 


1.7 CONCLUSIONS 


The purpose of this chapter has been to introduce the notion of adaptive control, 
to describe some adaptive systems, and to indicate why adaptation is useful. 
An adaptive controller was defined as a controller with adjustable parameters 
and a mechanism for adjusting the parameters. 

The key new element is the parameter adjustment mechanisms. Five ways 
of doing this were discussed: gain scheduling, auto tuning, model-reference 
adaptive control, self-tuning control, and dual control. To present a balanced 
account and to give the knowledge required to make complete systems, all 
aspects of the adaptive problem will be discussed in the book. 

Some reasons for using adaptive control have also been discussed in this 
chapter. The key factors are 


a yariations in process dynamics, 
e variations in the character of the disturbances, and 


e engineering efficiency and ease of use. 


Examples of mechanisms that cause variations in process dynamics have been 
given. The examples are simplistic; in many real-life problems it is difficult 
to describe the mechanisms analytically. Variations in the character of distur- 
bances is another strong reason for using adaptation. 
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Adaptive control is not the only way to deal with parameter variations. 
Robust control is an alternative. A robust controller is a controller that can 
satisfactorily control a class of system with specified uncertainties in the pro- 
cess model. To have a balanced view of adaptive techniques, it is therefore 
necessary to know these methods as well {see Chapter 10}. Notice particu- 
larly that there are few alternatives to adaptation for feedforward control of 
processes with varying dynamics. 

Engineering efficiency is an often overlooked argument in the choice be- 
tween different techniques. It may be advantageous to trade engineering ef- 
forts against more “intelligence” in the controller. This tradeoff is one reason 
for the success of automatic tuning. When a control loop can be tuned simply 
by pushing a button, it is easy to commission control systems and to keep them 
running well. This also makes it possible to use a more complex controller like 
feedforward. With toolboxes for adaptive control (such as ABB Master) it is 
often a simple matter to configure an adaptive control system and to try it ex- 
perimentally. This can be much less time-consuming than the alternative path 
of modeling, design, and implementation of a conventional control system, The 
knowledge required te build and use toolboxes for adaptive control is given in 
the chapters that follow. It should be emphasized that typical industrial pro- 
cesses are so complex that the parameter variations cannot be determined from 
first principles. 

A more complex controller may be used on different processes, and the de- 
velopment expenses can be shared by many applications. However, it should be 
pointed out that the use of an adaptive controller will not replace good process 
knowledge, which is still needed to choose the specifications, the structure of 
the controller, and the design method. 


PROBLEMS 


1.1 Look up the definitions of “adaptive” and “learning” in a good dictionary. 
Compare the uses of the words in different fields. 


1.2 Find descriptions of adaptive controllers from some manufacturers and 
browse through them. 


1.3 Give some situations in which adaptive control may be useful. What 
factors would you consider when judging the need for adaptive control? 


1.4 Make an assessment of the field of adaptive control by making a literature 
search. Look for the distribution of publications on adaptive control over 
the years. Can you see some pattern in the publications concerning uses 
of different methods, emphasis on theory and applications, and so on? 


1.5 


1.6 


1.7 


1.8 
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The system in Example 1.4 has the following characteristics: 


_ 1 
(s+ 1}? 

fu) =u" 
The PI controller has the gain K = 0.15 and the reset time T; = 1. 
Linearize the equations when the reference values are u, = 0.8, 1.1, and 
5.1, Determine the roots of the characteristic equation in the different 


cases, Determine a reference value such that the lmearized equations 
just become unstable. 


Go(s) = 


Consider the concentration control system in Example 1.5. Assume that 
Va = Vin = 1 and that the nominal flow is g = 1. Determine PI controllers 
with the transfer function 
1 
€ 1 mm 
K ( + 7) 


that give good closed-loop performance for the flows q = 0.5, 1, and 2. 
Test the controllers for the nominal flow. 


Consider the following system with two inputs and two outputs: 


a (7 ° ° 1 0 

=| 0-8 ofz+j|o 2]u 
0 0 -1 0 1 

{1109 

Zlio 1l” 


Assume that proportional feedback is introduced around the second loop: 


Ho = —kaya 
(a} Determine the transfer function from u to y1, and determine how the 
steady-state gain depends on kg. 


(b) Simulate the response of yı and yo when u is a step for different 
values of ko. 


A block diagram of a system used for metal cutting on a numerically 
controlled machine is shown in Fig. 1.25. The machine is equipped with 
a force sensor, which measures the cutting force. A controller adjusts 
the feedback to maintain a constant cutting force. The cutting force is 
approximately given by 
U a 
P=ra(k) 


N 


where a is the depth of the cut, v is the feed rate, N is the spindle speed, 
æ is a parameter in the range 0.5 < a < 1, and k is a positive parameter. 
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Figure 1.25 Block diagram of a control system for metal cutting. 


The steady-state gain from feed rate to force is 
K=kaav"'Nn* 


The gain increases with increasing depth a, decreasing feed rate v, and 

decreasing spindle speed N. Assume that œ = 0.7, k = 1, a = 1, ¢ = 0.7, 

and @ = 5. Determine T; such that the closed-loop system shows good 

closed-loop behavior for N = 1 and a = 1. 

(a) Investigate the performance of the closed-loop system when N varies 
between 0.2 and 2 and a = 1. 

(b) Repeat part (a) but for a varying between 0.5 and 4 and N = 1. 


Consider the system in Fig. 1.26. Let the process be 


Gols) ~ s+a 
where 
K = Ko + AK Ko = 
a = üo + Aa a9 = 


Figure 1,26 Block diagram for Problems 1.9 and 1.10. 
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and 
—O.5< AK < 2.0 
— 2.0 < Aa < 2.0 
Let the ideal closed-loop response be given by 
1 
Yn(s) = 41 U,(s) 


(a) Simulate the open-loop responses for some values of K and a. 

(b) Determine a controller for the nominal system such that the difference 
between step responses of the closed-loop system and of the desired 
system is less than 1% of the magnitude of the step. 

(c) Use the controller from part (b) and investigate the sensitivity to 
parameter changes. 

(d) Use the controller from part (b) and investigate the sensitivity to the 
disturbance d(t) when 


-1 O0<t<6 
d(t)={ 2 6<t<15 
1 16b<¢ 


(e) Use the controller from part (b) and investigate the influence of 
measurement noise, e(t). Let e(t} be zero mean white noise. 


This problem and the next example are based on a special session at 
the 1988 American Control Conference in Atlanta, Georgia. A detailed 
discussion of the problem is found in International Journal of Adaptive 
Control and Signal Processing, No. 2, June 1989, which is entirely devoted 
to the problem. 


Make the same investigation as in Problem 1.9 when the process is 
K 
Gols) = s+ astada 

where 

K = K +AK Kọo=1 

a, = io + ^a & = 1.4 

ag = deg + Aas don = 1 
and 

—05< AK < 2.0 


— 2.0 < Aa, < 2.0 
— 3.0 < Aa < 3.0 
Let the desired closed-loop response be given by 


1 


¥m(s) = s?4145+1 


U.{s) 
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CHAPTER 2 


REAL-TIME 
PARAMETER ESTIMATION 


2.1 INTRODUCTION 


On-line determination of process parameters is a key element in adaptive 
control. A recursive parameter estimator appears explicitly as a component 
of a self-tuning regulator (see Fig. 1.19). Parameter estimation also occurs 
implicitly in a model-reference adaptive controller (see Fig. 1.18). This chapter 
presents some methods for real-time parameter estimation. It is useful to view 
parameter estimation in the broader context of system identification. The key 
elements of system identification are selection of model structure, experiment 
design, parameter estimation, and validation. Since system identification is 
executed automatically in adaptive systems, it is essential to have a good 
understanding of all aspects of the problem. Selection of model structure and 
parameterization are fundamental issues. Simple transfer function models will 
be used in this chapter. The identification problems are simplified significantly 
if the models are linear in the parameters. 

The experiment design is crucial for successful system identification. In 
control problems this boils down to selection of the input signal. Choosing an 
input signal requires some knowledge of the process and the intended use of 
the model. In adaptive systems there is an additional complication because the 
input signal to the plant is generated by feedback. In certain cases this does 
not permit the parameters to be determined uniquely, a situation that has 
far-reaching consequences. In some cases it may be necessary to introduce 
perturbation signals, as discussed in more detail in Chapters 6 and 7. In 
adaptive control the parameters of a process change continuously, so it is 
necessary to have estimation methods that update the parameters recursively. 
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In solving identification problems it is very important to validate the results. 
This is especially important for adaptive systems, in which identification is 
done automatically. Some validation techniques will therefore be discussed. 

The least-squares method is a basic technique for parameter estimation. 
The method is particularly simple if the model has the property of being linear 
in the parameters. In this case the least-squares estimate can be calculated 
analytically. A compact presentation of the method of least squares is given 
in Section 2.2. The formulas for the estimate are derived, and geometric and 
statistical interpretations are given. It is shown how the computations can be 
done recursively. In Section 2.3 it is shown how the least-squares method can 
be used to estimate parameters in dynamical systems, Experimenta] conditions 
are discussed in Section 2.4, In particular we introduce the notion of persistent 
excitation. In using parameter estimation in adaptive control it is useful to 
have an intuitive insight into the properties of parameter estimators. To start 
to develop this, we give a number of simulations that illustrate the properties of 
the different algorithms in Section 2.5. More properties of different estimation 
schemes are given in Chapter 6 in connection with convergence and stability 
analysis of adaptive controllers. 


2.2 LEAST SQUARES AND REGRESSION MODELS 


Karl Friedrich Gauss formulated the principle of least squares at the end of the 
eighteenth century and used it to determine the orbits of planets and asteroids. 
Gauss stated that, according to this principle, the unknown parameters of a 
mathematical model should be chosen in such a way that 


the sum of the squares of the differences between the actually observed 
and the computed values, multiplied by numbers that measure the 
degree of precision, is a minimum. 
The least-squares method can be applied to a large variety of problems. It is 
particularly simple for a mathematical model that can be written in the form 


y(i) = pr (Of + Gali) OG +--+ On(OR = 97 (I) (2.1) 


where y is the cbserved variable, A, OS yess 6° are parameters of the model to 
be determined, and @1, 2, ..., 9, are known functions that may depend on 
other known variables. The vectors 


P (i) = (ili) Pali)... Pri) 
e=(e9 of ... ao)? 
have also been introduced. The model is indexed by the variable i, which often 
denotes time. It will be assumed initially that the index set is a discrete set. 


The variables g; are called the regression variables or the regressors, and the 
model in Eq. (2.1) is also called a regression model. Pairs of observations 
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and regressors {(y(i), p(i)), i= 1,2, ..., £} are obtained from an experiment. 
The problem is to determine the parameters in such a way that the outputs 
computed from the model in Eq. (2.1) agree as closely as possible with the 
measured variables y(i) in the sense of least squares. That is, the parameter 
@ should be chosen to minimize the least-squares loss function 


ve= 5 yO-9" (He)? (2.2) 


Since the measured variable y is linear in parameters 6° and the least-squares 
criterion is quadratic, the problem admits an analytical solution. Introduce the 
notations 

¥(t)=(9(1) 92) -s 

E(t)=(e(1) e€(2) ... eA) 


g* (1) 
e=| : 
p” (t) 
4 Ct 
P= (e720) = (Lower) (2.3) 


where the residuals e(i) are defined by 
efi) = y(i) — Hi) = y(i) — 9" (HE 
With these notations the loss function (2.2) can be written as 


V (6,2) -32 20- == SETE = > = JE 


where £ can be written as 
E-Y-Y=Y-e (2.4) 
The solution to the least-squares problem is given by the following theorem. 


THEOREM 2.1 Least-squares estimation 
The function of Eq. (2.2) is minimal for parameters Ê such that 


o'o§=o7Y (2.5) 
If the matrix 7 is nonsingular, the minimum is unique and given by 
= (@?@) oT y (2.6) 
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Proof: The loss function of Eq. (2.2) can be written as 
2V (0,t) = ETE = (Y — v6)" (Y — 08) 
= YTY —-YTpo -oT oy + 67 a w@ (2.7) 
Since the matrix @' is always nonnegative definite, the function V has a 
minimum. The loss function is quadratic in 8. The minimum can be found in 
many ways. One way is to determine the gradient of Eq. (2.7) with respect 
to @. (See Problem 2.1 at the end of the chapter). The gradient is zero when 


Eq. (2.5) is satisfied. Another way to find the minimum is by completing the 
square. We get 


2V(e,t)} = YTY —Y* oo — 0? oY + 6 oo 
+ Y7&(b? @) TY — Y" @(6"o) "TY 
= ¥7(1- 0(@70)'@7)y 


+ (9—(&7)@TY)"67@(6— (7b) G7Y) (2.8) 


The first term on the right-hand side is independent of @. The second term is 
always positive. The minimum is obtained for 


0 == (b79) oy 
and the theorem is proven. m 


Remark I. Equation (2.5) is called the normal equation. Equation (2.6) can 
be written as 


êle) = (È ewe") (x: PO) = PO (£ DO) C9 


Remark 2. The condition that the matrix ©’ is invertible is called an exci- 
tation condition. 


Remark 3. The least-squares criterion weights all errors £(i} equally, and this 
corresponds to the assumption that all measurements have the same precision. 
o 


Different weighting of the errors can be accounted for by changing the loss 
function (2.2) to 


V= 5 EWE (2.10) 


where W is a diagonal matrix with the weights in the diagonal. The least- 
squares estimate is then given by 


6 = (@TWo) oe WY (2.11) 
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EXAMPLE 2.1 Least-squares estimation of static system 
Consider the system 


y(i) = bo + byu(i) + bow? (i) + efi) 


where e(i) is zero mean Gaussian noise with standard deviation 0.1. The system 
is linear in the parameters and can be written in the form (2.1) with 


gy =(1 u(i) u?(i)) 
a? = (bo bi be) 


The output is measured for the seven different inputs shown by the dots in 
Fig. 2.1. In practice the model structure is usually unknown, and the user must 
decide on an appropriate model. We illustrate this by estimating parameters 
of the following models: 


Model 1: y(i) = bo 

Model 2: y(i) = bo + byu 

Model 3: y(i) = bo + biu + bzu? 

Model 4: y(i) = bo + biu + bgu? + bau? 


The different models give a polynomial dependence of different orders between. 
y and u. 

Table 2.1 shows the least-squares estimates of the different models together 
with the resulting loss function. Figure 2.1 also shows the estimated relation 
between u and y for the different models. From the table it is seen that about 
the same losses are obtained for Models 3 and 4. The fit to the data points is 
almost the same for these two models, as is seen in Fig. 2.1. o 


The example shows that it is important to choose the correct model struc- 
ture to get a good model. With few parameters it is not possible to get a good 
fit to the data. If too many parameters are used, the fit to the measured data 
will be very good but the fit to another data set may be very poor. This latter 
situation is called overfttting. 


Table 2.1 Least-squares estimates and loss functions for the system in 
Example 2.1 using different model structures. 


Model bo by b, Bs V 
1 3.85 34.46 
2 0.57 1.09 1.01 
3 1.11 0.45 0.11 0.031 
4 1.13 0.37 0.14 —0.003 0.027 
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(a) 8 


o> 


4 2 
Input Input 


0 2 4 6 
Input Input 


Figure 2.1 The dots represent the measured data points. Resulting models, 
indicated by the solid lines, based on the least-squares estimates are also 
given for (a) Model 1, (b) Model 2, (c) Model 3, (d) Model 4. 


Geometric Interpretation 


The least-squares problem can be interpreted as a geometric problem in F’, 
where ¢ is the number of observations. Figure 2.2 illustrates the situation with 
two parameters and three observations, The vectors p and g* spans a plane if 
they are linearly independent. The predicted output Ÿ lies in the plan spanned 
by g! and g?. The error £ = Y — Y is smallest when £ is orthogonal to this 
plane. In the general case, Eq. (2.4} can be written as 


e(1) y(1) ı(1) Pn(1) 
e(2) y2) [e] ” pı(2) | Pn(2) 
. |= 1-0 = . n 
e(t) y) gilt) Pn(E) 
i E = Y — g'01 — 9°62 — p" On 


where ø are the columns of the matrix ®. The least-squares problem can 
thus be interpreted as the problem of finding constants 61,...,0, such that 
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Figure 2.2 Geometric interpretation of the least-squares estimate. 


the vector Y is approximated as well as possible by a linear combination of the 
vectors g1, 9”,...,@7. Let Y be the vector in the span of g’, g?,...,9", which 
is the best approximation, and let £ = Y — Y. The vector £ is smallest when 
it is orthogonal to all vectors g’. This gives 


(T (¥-09'—e9"---- 0,9") =0 iht 


which is identical to the normal equation (2.5). The vector 0 is unique if the 
vectors g!,g,...,@” are linearly independent. 


Statistical Interpretation 


The least-squares method can be interpreted in statistical terms. It is then nec- 
essary to make assumptions about how the data has been generated. Assume 
that the process is 

y(i) = g" (O° + e(i) (2.12) 
where 6° is the vector of “true” parameters and {e(z),i = 1,2,...} is a sequence 
of independent, equally distributed random variables with zero mean. It is also 
assumed that e is independent of gy. Equation (2.4) can be written as 


Y=DO+E 
Multiplying by (®?@)-10? gives 
(Ta) oY = 6 = 6 + (TP) OTE (2.13) 


Provided that £ is independent of $7, which is equivalent to saying that e(i) 
is independent of (i), the mathematical expectation of Ê is equal to 6°. An 
estimate with this property is called unbiased. The following theorem is given 
without proof. 
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THEOREM 22 Statistical properties of least-squares estimation 


Consider the estimate in Eq. (2.6) and assume that data is generated from 
Eq. (2.12), where {e(i),i = 1,2,...} is a sequence of independent random vari- 
ables with zero mean and variance co”. Let E denote mathematical expectation 
and cov the covariance of a random variable. 

If &? @ is nonsingular, then 


(i) E@(t) = 6° 
(ii) cov O(t) = o? (HTP) 
(iii) &*(t) = 2V(6,t)/(t — n) is an unbiased estimate of a? 


where n is the number of parameters in @° and 6 and ¢ is the number of data 
points. Oo 

The theorem states that the estimates are unbiased, that is, E6(t) = 6°. 
Further, it is desirable that an estimate converge to the true parameter value 
as the number of observations increases toward infinity. This property is called 
consistency. There are several notions of consistency corresponding to different 
convergence concepts for random variables. Mean square convergence is one 
possibility, which can be investigated simply by analyzing the variance of the 
estimate. The result (ii) can be used to determine how the variance of the 
estimate decreases with the number of observations. This is illustrated by an 
example. 


EXAMPLE 2.2 Decrease of variance 


Consider the case in which the model in Eq. (2.12) has only one parameter. 
Let ¢ be the number of observations. It follows from (ii) of Theorem 2.2 that 
the variance of the estimate is given by 


a 
a) 
k=1 


Several different cases can now be considered, depending on the asymptotic 

behavior of g(k) for large k. Introduce the notation a ~ b to indicate that a 

and b are proportional. 

(a) Assume that g(k) ~ e-**,a > 0. The sum in the denominator above then 
converges, and the variance goes to a constant. 


(b) Assume that p(k) ~ k™,a > 0. Then 


cov 6 = 


const a > 0.5 


t 
S e'(h)~ 4 logt =a = 0.5 
Hel pm a05 


The variance goes to zero if a < 0.5. 
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(c} Assume that (k) ~ 1. The variance then goes to zero as 1/t. 
(d) Assume that o{k) ~ k*,a > 0. The variance then goes to zero as t~(+?9), 


(e) Assume that g(k) ~ e@*,a > 0. The variance then goes to zero as e", 
L 


The example shows clearly how the precision of the estimate depends on 
the rate of growth of the regression vector. The variance does not go to zero 
with increasing number of observations if the regression variable decreases 
faster than 1/,/t. In the normal situation, when the regressors are of the same 
order of magnitude, the variance decreases as 1/t. The variance decreases more 
rapidly if the regression variables increase with time. 

When several parameters are estimated, the convergence rates may be 
different for different parameters. This is related to the structure of the matriz 
(PTP! in Eq. (2.6). 


Recursive Computations 


In adaptive controllers the observations are obtained sequentially in real time. 
It is then desirable to make the computations recursively to save computation 
time. Computation of the least-squares estimate can be arranged in such a way 
that the results obtained at time t—1 can be used to get the estimates at time 
t. The solution in Eq. (2.6) to the least-squares problem will be rewritten in a 
recursive form. Let 6(f — 1) denote the least-squares estimate based on £ — 1 
measurements. Assume that the matrix ®7@ is nonsingular for all t. It follows 
from the definition of P(t) in Eq. (2.3) that 


P(t) = &7 (f(t) = > e(97() 


i=l 
-= Foig ) + p97 0) 
i=l 
= P(t — 1) + olto? (t) (2.14) 


The least-squares estimate 6(¢) is given by Eq. (2.9): 


ôl) = P) B PO) = P(t) B plidy(i) + PeO) 
i=1 i=1 


It follows from Eqs. (2.9) and (2.14) that 


i—i 


> e@y@ = PHE- DÂ- 1) = PAE- D - plo (HED 
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The estimate at time t can now be written as 


8(t) = 6( — 1) — P(e" OAE — 1) + POPEYE) 
= 6(t — 1) + Po) (vE) — 9? MA(E— 1) 
= 6(t — 1) + K(tje(t) 
where 
K(é) = Pe) 
elt) = x(t) — 97 (EE —1) 


The residual (t) can be interpreted as the error in predicting the signal y(¢) 
one step ahead based on the estimate 6(¢ — 1). 

To proceed, it is necessary to derive a recursive equation for P(t} rather 
than for P(t)~1 as in Eq. (2.14). The following lemma is useful. 


LEMMA 21 Matrix inversion lemma 


Let A, C, and C-!+ DAB be nonsingular square matrices. Then A+ BCD 
is invertible, and 


(A+ BCDY! = A+—A™B(C™ + DAB) DA“! 


Proof: By direct multiplication we find that 
(A+ BCD) (A“—A'B(C* + DA™"B)"DA”") 
= I+ BCDA™! — B(C1 + DATIBY IDA! 
— BCDA™“B(C 14 DAB) ŻDA! 
= I + BCDA™ — BC(C™ + DA™"B)(C1+DA'B)"DA" 
=f my 


Applying Lemma 2.1 to P(t) and using Eq. (2.14), we get 


P(t) = (®7()O()) = (7 ¢-1e(¢-1) + e(HE"(0) 
= (P(t— 1)" + oHe) 
= P(t—1) — P(t- Dolt) (I+ 9" HPE- Ne)” 7 APE- 1) 
This implies that 


K (t) = P(e(t) = P(t— ett) (2+ 97 (HPE- L)g(t)) 


Notice that a matrix inversion is necessary to compute P. However, the matrix 
to be inverted is of the same dimension as the number of measurements. That 
is, for a single output system it is a scalar. 

The recursive calculations are summarized in the following theorem. 
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THEOREM 2.3 Recursive least-squares estimation (RLS) 

Assume that the matrix P(t) has full rank, that is, $7 (¢)(t) is nonsingular, 
for all ¢ > fo. Given 4(é) and P(to) = (&*(to)@{to))—', the least-squares 
estimate 6(t) then satisfies the recursive equations 


6(£) = G(t — 1) + K (£) (y(t) — 9? (H(t — 1) (2.15) 

K (t) = P(@)g(t) = P(t— Ye) (1 + 97 HAPE — Delt) (2.16) 
P(t) = P(t — 1) — PE- Del + OPE- Nels) 97 OP(t-D 

= (I — K(#)97 (£))P(t— 1) (2.17) 

m 


Remark 1. Equation (2.15) has strong intuitive appeal. The estimate 6(2) is 
obtained by adding a correction to the previous estimate 6(#—1). The correction 
is proportional to y(t) — o7 (¢)}6(é — 1), where the last term can be interpreted 
as the value of y at time ¢ predicted by the model of Eq. (2.1). The correction 
term is thus proportional to the difference between the measured value of 
y(t) and the prediction of y(t) based on the previous parameter estimate. The 
components of the vector K(£) are weighting factors that tell how the correction 
and the previous estimate should be combined. 
Remark 2. The least-squares estimate can be interpreted as a Kalman filter 
for the process 

a(t +1) 0) (2.18) 

y(t} = p (4)8(2) + e&t) 

Remark 3. The recursive equations can also be derived by starting with the 
loss function of Eq. (2.2). Using Eqs. (2.8) and (2.6) gives 


2V(0,t) = 2V (0,t — 1) + £7(6,t) 
= ¥?(¢—1) (I-@(¢-1)(@7(¢-o(e—1))-1)) Y- 1) 
+ (0 —6(£—1))' T(t- 1) — 1)(@ — êt — 1)) 
+ (y(t) — 97 (0) (10) — 97 (8)9) (2.19) 


The first term on the right-hand side is independent of 0, and the remaining 
two terms are quadratic in @. V{@,£) can then easily be minimized with respect 
to 0. o 


Notice that the matrix P(#) is defined only when the matrix 7 (z)®(t) is 
nonsingular. Since 


t 
7e) = XJ eiea 
i=1 
it follows that 7 is always singular if t < n. To obtain an initial condition 
for P, it is thus necessary to choose £ = tp such that %7 (t)) (io) is nonsingular. 
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The initial conditions are then 


P(to) = (©7 (to) (to) 
6(to) = P(to)®? (to) ¥ (to) 


The recursive equations can then be used for £ > to. It is, however, often 
convenient to use the recursive equations in all steps. If the recursive equations 
are started with the initial condition 


where Po is positive definite, then 
P(t) = (Pat + ©? (f(t) 


Notice that P(ż) can be made arbitrarily close to (7 (£)@(t))* by choosing Po 
sufficiently large. 

By using the Kalman filter interpretation of the least-squares method, 
it may be seen that this way of starting the recursion corresponds to the 
situation in which the parameters have an initial distribution with mean ĝo 
and covariance Po. 


Time-Varying Parameters 


In the least-squares model (2.1) the parameters 8? are assumed to be constant. 
In several adaptive problems it is of interest to consider the situation in which 
the parameters are time-varying. Two cases can be covered by simple exten- 
sions of the least-squares method. In one such case parameters are assumed to 
change abruptly but infrequently; in the other case the parameters are chang- 
ing continuously but slowly. The case of abrupt parameter changes can be cov- 
ered by resetting. The matrix F in the least-squares algorithm (Theorem 2.3) 
is then periodically reset to aI, where a is a large number. This implies that 
the gain K (ż) in the estimator becomes large and the estimate can be updated 
with a larger step. A more sophisticated version is to run n estimators in par- 
allel, which are reset sequentially. The estimate is then chosen by using some 
decision logic. (See Chapter 6.) The case of slowly time-varying parameters can 
be covered by relatively simple mathematical models. One pragmatic approach 
is simply to replace the least-squares criterion of Eq. (2.2) with 


v(e) = 5 4 (6-97 )" (2.20) 
i=l 


where A is a parameter such that 0 < A < 1. The parameter A is called the 
forgetting factor or discounting factor. The loss function of Eq. (2.20) implies 
that a time-varying weighting of the data is introduced. The most recent data 
is given unit weight, but data that is n time units old is weighted by A”. The 
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method is therefore called exponential forgetting or exponential discounting. 
By repeating the calculations leading to Theorem 2.3 for the loss function of 
Eq. (2.20), the following result is obtained. 


THEOREM 24 Recursive least squares with exponential forgetting 


Assume that the matrix ®(t} has full rank for t > fọ. The parameter 8, which 
minimizes Eq. (2.20), is given recursively by 


6(t) = 6(¢ — 1) + K (Ð (y(t) — 97 (6(t — 1)) 
K(é) = P()e(t) = Pt- ge) (A+ e7@)PE-Ve)) (220) 
P(t) = (1—K(t)p*()) P(t — 1) /å o 


A disadvantage of exponential forgetting is that data is discounted even 
if P(t)g(t) = 0. This condition implies that y(t) does not contain any new 
information about the parameter @. In this case it follows from Eqs. (2.21) 
that the matrix P increases exponentially with rate A. Several ways to avoid 
this are discussed in detail in Chapter 11. 

An alternative method of dealing with time-varying parameters is to as- 
sume a time-varying mathematical model. Time-varying parameters can be 
obtained by replacing the first equation of Eqs. (2.18) with the model 


O(t + 1) = &,6(t) + v(t) 


where ©, is a known matrix and v(t} is discrete-time white noise. The filtering 
interpretation of the least-squares problem given in Remark 2 of Theorem 2.3 
can now easily be generalized. The least-squares estimator will then be the 
Kalman filter. The case ©, = I corresponds to a model in which the parameters 
are drifting Wiener processes. 


Simplified Algorithms 


The recursive least-squares algorithm given by Theorem 2.3 has two sets of 
state variables, Ê and P, which must be updated at each step. For large 
n the updating of the matrix P dominates the computing effort. There are 
several simplified algorithms that avoid updating the P matrix at the cost of 
slower convergence. Kaczmarz's projection algorithm is one simple solution. To 
describe this algorithm, consider the unknown parameter as an element of R". 
One measurement 

y(t) = p" (t)0 (2.22) 
determines the projection of the parameter vector 8 on the vector g(t). From 
this it is immediately clear that n measurements, where ¢(1),...,@(m) span 
R” , are required to determine the parameter vector 6 uniquely. Assume that an 
estimate 6(t—1) is available and that a new measurement such as Eq. (2.22) is 
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obtained, Since the measurement y(t} contains information only in the direction 

ọ(t) in parameter space, it is natural to choose as the new estimate the value 
6(£) that minimizes ||6(t) — 6(¢ — 1)|| subject to the constraint y(t} = 7 (t)6(t). 
Introducing a Lagrangian multiplier @ to handle the constraint, we thus have 
to minimize the function 


v= 5 (6(e) — ôe- D)” (6@ — ôE- D) + (W — (6) 
Taking derivatives with respect to @(t) and &, we get 


6(t) — 6(t — 1) — &g(t) = 0 
y(t) — 97 (êt) = 0 


Solving these equations gives 


A 5 g(t) T 
A(t) = 0t — 1) + y(t) — o* (HOE -—1 2.23 
(0) = 61) + pag CO- E-D) (2.28) 
The updating formula is called Kaczmarz’s algorithm. It is useful to be able to 
change the step length of the parameter adjustment by introducing a factor y. 
This gives 

Â A v(t) TryG 

ô = 6-1) + S$ (yA — 7? (QO — 1 

© = ôe- D + hy OO — 976-0) 

To avoid a potential problem that occurs when g(t) = 0, the denominator in 
the correction term is changed from 97 (t)g(é) to g7(#)o(t) + &, where æ is a 
positive constant. The following algorithm is then obtained. 


ALGORITHM 21 Projection algorithm 
A A rolt) TinÂ 
Ət) = O — 1) + —— (t) tjo —=1 2.24 
(0) = 8-1) + zr g O-O) 224) 
where œ > 0 and 0 <y <2. m 


Remark 1. In some textbooks this is called the normalized projection algo- 
rithm. 

Remark 2. The bound for the parameter y is obtained from the following 
analysis. Assume that data has been generated by Eq. (2.22) with parameter 
ð = 6°. It then follows from Eq. (2.24) that the parameter error 


6-6-6 


satisfies the equation 
O(t) = A()6(t — 1) 
where a) Te) 
YPE 
AD =T- toT 
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The matrix A(t) has one eigenvalue, 
a- 2+0- nep 
= T 
a+ gp 
This value is less than 1 in magnitude if 0 < y < 2. The other eigenvalues of 
A are all equal to 1. oc 


The projection algorithm assumes that the data is generated by Eq. (2.22) 
with no error. When the data is generated by Eq. (2.12) with additional random 
error, a simplified algorithm is given by 

Ô) = O(t — 1) + Pe) (x — 9? (A(t — D) (2.25) 
where 
t -1 
P(t) = (£ 090] (2.26) 
i=1 
This is the stochastic approximation (SA) algorithm. Notice that P(t) = bd? 
is now a scalar when y(t) is a scalar. A further simplification is the least mean 
square (LMS) algorithm in which the parameter updating is done by using 


G(t) = Ê(t — 1) + volt) (x@) — 97 (HÊ — 1) 
where y is a constant. 


Continuous-Time Models 


In the recursive schemes the variables have so far been indexed by a discrete 
parameter ¢. The notation £ was chosen because in many applications it de- 
notes time. In some cases it is natural to use continuous-time observations. 
It is straightforward to generalize the results to this case. Equation (2.1) is 
still used, but 2 is now assumed to be a real variable. Assuming exponential 
forgetting, the parameter should be determined such that the criterion 


V(6) = f eal- (y(t) — p7 (1)0) de (2.27) 


is minimized. The parameter a, where a > 0, corresponds to the forgetting 
factor A in Eq. (2.20). A straightforward calculation shows that the criterion 
is minimized if (see Problem 2.15 at the end of the chapter) 


É t 
( f eV 9(z) 97 (2) dr) A(t) = f eV (r)y(2) dr (2.28) 
which is the normal equation. The estimate is unique if the matrix 
t 
R(t) = f e@') o(z) 7 (2) dz (2.29) 
0 


is invertible. It is also possible to obtain recursive equations by differentiating 
Eq. (2.28). The estimate is given by the following theorem. 
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THEOREM 25 Continuous-time least-squares estimation 


Assume that the matrix R(t) given by Eq. (2.29) is invertible for all £. The 
estimate that minimizes Eq. (2.27) satisfies 


a) = P(t)p(t)e(t) (2.30) 

e(t) = y(t) — 97 (D(t) | (2.31) 

TO = P(t) — POl (PLO) (2.32) 

Proof: The theorem is proved by differentiating Eq. (2.28). O 


Remark 1. The matrix R(t) = P(é)~' satisfies 


RO _ aR() + pOT 


Remark 2. There are also continuous-time versions of the simplified algo- 
rithms. The projection algorithm corresponding to Eqs. (2.25) and (2.26) is 
given by Eq. (2.30) with 


Po=([ eels) ac) 


where P(t) is now a scalar. Oo 


2.3 ESTIMATING PARAMETERS IN DYNAMICAL SYSTEMS 


We now show how the least-squares method can be used to estimate parameters 
in models of dynamical systems. The particular way to do this will depend on 
the character of the model and its parameterization. 


Finite-Impulse Response (FIR) Models 


A linear time-invariant dynamical system is uniquely characterized by its im- 
pulse response. The impulse response is in general infinite-dimensional. For 
stable systems the impulse response will go to zero exponentially fast and may 
then be truncated. Notice, however, that a large number of parameters may be 
required if the sampling interval is short in comparison to the slowest time con- 
stant of the system. This results in the so-called finite impulse response (FIR) 
model, which is also called a transversal filter. The model can be described by 
the equation 


y(t) = byu(t — 1) + beu(é — 2) +---+ b,u(t — n) (2.33) 
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y(t) =p" (¢-1)e 


eT =(b ... bn) 


g?(t-1)=(u(t-1) ... u(t—n)) 

This model is identical to the regression model of Eq. (2:1), except for the 
index ¢ of the regression vector, which is different. The reason for this change 
of notation is that it will be convenient to label the regression vector with 
the time of the most recent data that appears in the regressor. The model of 
Eq. (2.33) clearly fits the least-squares formulation, and the estimator is then 
given by Theorem 2.3. 

The parameter estimator can be represented by the block diagram in 
Fig. 2.8. The estimator may be regarded as a system with inputs u and y 
and output @. Since the signal 


p(t) = ba(t — u(t — 1) +- + nlt — Dut — n) 


is available in the system, we can also consider f(t) as an output. Since f(t} is 
a predicted estimate of y, the recursive estimator can also be interpreted as an 
adaptive filter to predict y. The use of this filter is discussed in Chapter 18. 


where 


Transfer Function Models 


The least-squares method can be used to identify parameters in dynamical 
systems. Let the system be described by the model 


A(q)y(£) = B(q)u(é) (2.34) 
where q is the forward shift operator and A(gq) and B(g) are the polynomials 
Ald) =g +a +.. Han 
B(q) = brg"! + bog? +... + bm 


Adjustment 
mechanism 


Figure 2.3 Block diagram representation of a recursive parameter estima- 
tor for an FIR model. 
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Equation (2.34) can be written as the difference equation 
y(t} + ayy(t—1) +- -+ an9(t n) = bult +m—n—1)+---+b,ult—n) 


Assume that the sequence of inputs {u(1),u(2), ...,u(é)} has been applied to 
the system and the corresponding sequence of outputs {y(1), y(2), .. -, y(£)} has 
been observed. Introduce the parameter vector 


OF =(a, ... an bi ... bm) (2.35) 
and the regression vector 
eg  (t-1) =(—v(t-1) ... -x(t-—n) ult+m—n-1) ... u(t—n)) 


Notice that the output signal appears delayed in the regression vector. The 
model is therefore called an autoregressive model. The way in which the ele- 
ments are ordered in the matrix @ is, of course, arbitrary, provided that ¢(t—1) 
is also similarly reordered. Later, in dealing with adaptive control, it will be 
natural to reorder the terms. The convention that the time index of the @ vec- 
tor will refer to the time when all elements in the vector are available will also 
be adopted. The model can formally be written as the regression model 


y(t) = 9" (t—1)8 


Parameter estimates can be obtained by applying the least-squares method 
(Theorem 2.1). The matrix @ is given by 


g” (n) 
p = : 
g*(t—-1) 

If we use the statistical interpretation of the least-squares estimate given by 
Theorem 2.2, it follows that the method described will work well when the 


disturbances can be described as white noise added to the right-hand side of 
Eq. (2.34). This leads to the least-squares model 


Alq)y(t) = B(g)u(é) + e(t +n) 
(Compare with Eq. (2.12).} The method is therefore called an equation error 
method. A slight variation of the method is better if the disturbances are 


described instead as white noise added to the system output, that is, when 
the model is 
B(q) 


y(t) = Ala u(t) + e(é) 


The method obtained is then called an output error method. To describe such a 
method, let u be the input and 7 be the output of a system with the input-output 
relation 


IE) + a(t — 1) + --- +a, H(t —n) = byu +m—n—1)+---+6b,u(t—n) 
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that is, 


Me) = FS ull 


Determine the parameters that minimize the criterion 
t 
an yy2 
XO Ok) — (2) 
k=1 


where y(t} = $(£) + e(t). This problem can be interpreted as a least-squares 
problem whose solution is given by 


6(t) = O(t — 1) + P(A) g(t — Leli 
where 
p` (f-1)=(-Ht—-1) ... —ş(t—-n) ut+m-n—1) ... u(t—n)) 
e(t) = y(t) — 97 (t -—1)ôt-— 1) 


Compare with Theorem 2.1. The recursive estimator obtained can be repre- 
sented by the block diagram in Fig. 2.4. 


Continuous-Time Transfer Functions 


We now show that the least-squares method can also be used to estimate 
parameters in continuous-time transfer functions. For instance, consider a 
continuous-time model of the form 


dy dy _, du 
gyn + gpa Ho t any = br yt + Ome 
which can also be written as 
A(p)y(t) = B(p)u(2) (2.36) 


Adjustment 
mechanism 


Figure 2.4 Block diagram of a least-squares estimator based on the output 
error. 
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where A(p) and B(p) are polynomials in the differential operator p = d/dt. 
In most cases we cannot conveniently compute p” y(t) because it would involve 
taking n derivatives of a signal. The model of Eq. (2.36) is therefore rewritten 


A(p)y-(t) = B(p)u s(t) (2.37) 


yr (t) = Arp) 

u(t) = Hy (p)u(t) 
and Hr(p) is a stable transfer function with a pole excess of n or more. See 
Fig. 2.5. If we introduce 


where 


@=(a. ... Gn by ... bn)” 
p (t}=(—p™ typ... —yp Pup ... up) 
=(-p’"Hy(p)y ... —He(p)y p™Hy(p)u ... Hy(pyu) 


the model expressed by Eq. (2.87) can be written as 


pyr (t) = p”Hy(p) y(t) = 97 (98 
By a proper realization of the filter H; it is possible to use one filter to generate 
all the signals p'H;(p)y, i = 0, ..., n, and another filter to generate p'H;(p)u, 
i = 0,...,m — 1. Standard least squares can now be applied, since this is 
a regression model. A recursive estimate is given by Theorem 2.5. With the 
restriction on Hp there will not be any pure differentiation of the output or the 
input to the system. 


Nonlinear Models 


Least squares can also be applied to certain nonlinear models. The essential 
restriction is that the models be linear in the parameters so that they can be 


Parameter estimate f 


CO] Estimator —<—— H 
l m] 


p'H pmu p'H,y(p)y 


Process 
Input u Output y 


Figure 2.5 Block diagram of estimator with filters Hp. 
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written as linear regression models. Notice that the regressors do not need to 
be linear in the inputs and outputs. An example illustrates the idea. 


EXAMPLE23 Nonlinear system 
Consider the model 


y(t) + ay(t — 1} = byu(t — 1) + be sin(u(t)) 
By introducing 
@=(a by by)? 
and 
97 (t) = (v(t) uft) sin(u(t))) 
the model can be written as 
y(t) = g" (t— 1)8 


The model is linear in the parameters, and the least-squares method can be 
used to estimate @. 0 


Stochastic Models 


The least-squares estimate is biased when it is used on data generated by 
Eq. (2.12), where the errors e(i) are correlated. The reason is that Eg" (i)e(i) # 
0 (compare Eq. (2.18)}). A possibility to cope with this problem is to model the 
correlation of the disturbances and to estimate the parameters describing the 
correlations. Consider the model 


A(a)y(t) = B(q)u(t) + C(a)e(t) (2.38) 
where A(q), B(q), and C(qg) are polynomials in the forward shift operator 
and {e(t)} is white noise. The parameters of the polynomial C describe the 
correlation of the disturbance. The model of Eq. (2.38) cannot be converted 
directly to a regression model, since the variables {e(t}} are not known. A 
regression mode! can, however, be obtained by suitable approximations. To 
describe these, introduce 

elt) = y(t) — 97 (t—1)6(t— 1) 
where 


OS(ai sc Gy By son bn Cy cs Cn) 


gT (t-1) = Colt-1) ... —y(t—n) ult—-1) ... u(t~n) e(t—1) ... e(t—n)) 


The variables e(¢) are approximated by the prediction errors £(t). The model 
can then be approximated by 


y(t} = g” (t — 1)8 + e(t) 
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and standard recursive least squares can be applied. The method obtained is 
called extended least squares (ELS). The equations for updating the estimates 
are given by 

6(t) = 6(t — 1) + P(t)e(t — 1)e(t) 


P(t)}=P(t-l)+e¢-Ye7™e—-1) 


(Compare with Theorem 2.3.) Another method of estimating the parameters in 
Eq. (2.38) is to use Eqs. (2.39) and let the residual be defined by 


(2.39) 


C(a)e(t) = A(a)y(t) — Ê (aul) (2.40) 
and regression vector p in Eqs. (2.39) be replaced by øf, where 
C(a)gr(t) = p) (2.41) 


The most recent estimates should be used in these updates. The method ob- 
tained is then not truly recursive, since Eqs. (2.41) and (2.40) have to be solved 
from t = 1 for each measurement. The following approximations can be made: 
elt) = y(t) — 9F E- 1t- 1) 
This algorithm is called the recursive maximum likelihood (RML) method. 
It is advantageous for both ELS and RML to replace the residual in the 
regression vector by the posterior residual defined as 
Ep(t) = y(t) — p7 (t — 1) A(t) 


that is, the latest value of Ê is used to compute £p. 
Another possibility to model the correlated noise is to use the model 


v(t) = FE ule) + SE ett) 


instead of Eq. (2.38). Recursive parameter estimates for this model can be 
derived in the same way as for Eq. (2.38). 

Details about the extended least-squares method and the recursive maxi- 
mum likelihood method are found in the references at the end of the chapter. 


Unification 


The different recursive algorithms discussed are quite similar. They can all be 
described by the equations 


6(t) = O(¢ — 1) + P(t — L)e(t) 


_ P¢t—1)e(t—1)e* (t- 1)P(t- 1) 
PO) = 7 (Pe-2)- + oT (t—DPE—1ee—1) ) 


where 0, p, and € are different for the different methods. 


2.4 Experimental Conditions 63 


2.4 EXPERIMENTAL CONDITIONS 


The properties of the data used in parameter estimation are crucial for the 
quality of the estimates. For example, it is obvious that no useful parameter 
estimates can be obtained if all signals are identically zero. In this section we 
discuss the influence of the experimental conditions on the quality of the es- 
timates. In performing system identification automatically, as in an adaptive 
system, it is essential to understand these conditions, as well as the mecha- 
nisms that can interfere with proper identification. The notion of persistent 
excitation, which is one way to characterize process inputs, is introduced. In 
adaptive systems the plant input is generated by feedback. Difficulties caused 
by this are also discussed. 


Persistent Excitation 


Let us first consider estimation of parameters in a FIR model given by Eq. 
(2.33). The parameters of the model cannot be determined unless some condi- 
tions are imposed on the input signal. It follows from the condition for unique- 
ness of the least-squares estimate given by Theorem 2.1 that the minimum is 
unique if the matrix 


t t 


wk) S u(k—u(k—-2) ... So ulk—-Du(k—n) 
R X ulk- 1ju(k-2) Xu (k—2) ... $ ufk-2ju(k-n) 
`p = ati n+l n+l 
5 u(e—Au(e—n) Suen) 
n+ n+ (2.42) 


has full rank. This condition is called an excitation condition. For long data 
sets, all sums in Eq. (2.42} can be taken from 1 to £. We then get 


c(0) c(i) ... e(n—1) 
Lo e(1) c(0) ... e(n—2) 
Com fim OPO = | (2.48) 
ce(n—1) c(n—2) ... c(0) 


where c(k) are the empirical covariances of the input, that is, 


e(k} = lim © > vll = k) (2.44) 


64 Chapter 2 Real-Time Parameter Estimation 


For long data sets the condition for uniqueness can thus be expressed as 
the matrix in Eq. (2.43) being positive definite. This leads to the following 
definition. 


DEFINITION 21 Persistent excitation 


A signal u is called persistently exciting (PE) of order n if the limits (2.44) exist 
and if the matrix ©, given by Eq. (2.43) is positive definite. 

Remark 1. In the adaptive control literature an alternative definition of PE 
is often used. The signal u is said to be persistently exciting of order n if for 
all t there exists an integer m such that 


ttm 


al > X g{k)g? (k) > oot 


k=t 


where £1, 02 > 0 and the vector g(t) is given by 


g(t}=(u(t—1) u(t-—2) ... u(t—n)) 
Notice that the matrix (2.43} can be written as 


ię 
C, = lim 5 57 olko” (®) 
k=1 


Remark 2. Notice that no mean value is included in the definition of the 
empirical covariance c{k) in Eq. (2.44). o 


The following result can be established. 


THEOREM 26 Consistency for FIR models 


Consider least-squares estimation of the parameters of a finite impulse re- 
sponse model with n parameters. The estimate is consistent and the variance 
of the estimates goes to zero as 1/t if the input signal is persistently exciting 
of order n. 


Proof: The result follows from Definition 2.1 and Theorem 2.2. oO 
We now introduce the following theorem. 


THEOREM 27 Persistently exciting signals 
The signal u with the property (2.44) is persistently exciting of order n if and 
only if 
ig 2 
U = lim = > (A(q)u(k))? > 0 (2.45) 


for all nonzero polynomials A of degree n — 1 or less. 
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Proof: Let the polynomial A be 
A(q) = aog! + aig’? + +++ + ana 
A straightforward calculation shows that 


i ł - 
U = lim q 9 (aou(k +n —1) +-+- + dnau(h))? = a7 Caa 
k=1 


where Cn is the matrix given by Eq. (2.43). If C, is positive definite, the right- 
hand side is positive for all a, and so is the left-hand side. Conversely, if the 
left-hand side is positive for all a, so is the right-hand side. oO 


The result is useful in investigating whether special signals are persistently 
exciting. 


EXAMPLE 2.4 Pulse 


It follows from Eq. (2.45) that C, — 0 for all n if u is a pulse. A pulse thus is 
not PE for any n. D 


EXAMPLE 2.5 Step 
Let u(t) = 1 for t > 0 and zero otherwise. It follows that 


(qu) = 49 izo 


A step can thus at most be PE of order 1. Since 
1 t 
= 2 Z 
QG=7 Yu’ (k)=1 
k=1 
it follows that it is PE of order 1. oO 
EXAMPLE 2.6 Sinusoid 

Let u(t) = sin øt. It follows that 


(q? —2gcosm + 1) u(t) = 0 


A sinusoid can thus at most be PE of order 2. Since 


a= 5 l 1 = | 


2 | coso 1 


it follows that a sinusoid is actually PE of order 2. Oo 
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EXAMPLE 2.7 Periodic signal 
Let u(t} be periodic with period n, It then follows that 


(a° — 1) u(t) = 0 
The signal can thus at most be PE of order n. G 


EXAMPLE28 Random signals 
Consider the stochastic process 


u(t) = H(q)e(£) 


where e{£) is white noise and H (q) is a pulse transfer function. It follows from 
the definition of white noise that Eq. (2.45) is satisfied for the signal e for any 
nonzero polynomial A(g). This property also holds for the signal u. The signal 
u is thus PE of any order. D 


To be able to give a frequency domain interpretation of PE, it is useful to 
use the following theorem, which is given without proof. 


THEOREM 28 Parseval’s theorem 
Let 


H(g") = So hq” 
k=0 


Gig") = >> oa 
k=0 


be two stable transfer functions, and let e(t) be white noise of zero mean and 
covariance o°, Then 


g? X bags = a L H(e)G(e) da 


k=0 
m 
Remark. The left-hand side can be interpreted as E (H(q’)e(t) - G(ghe(t)), 


that is, the covariance of the two signals obtained by sending white noise 
through the transfer functions H(qg-1) and G(q7?). m) 


EXAMPLE 2.9 Frequency domain characterization 


Consider a quasi-stationary signal u(t) with spectrum $,{w). It follows from 
Parseval’s theorem that 


Him DAUD =, [Ae aodo (246) 
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This equation gives considerable insight into the notion of persistent excitation. 
A polynomial of degree n—1 can at most vanish in n—1 points. The right-hand 
side of Eq. (2.46) will thus be positive if ®,(@) # 0 for at least n points in the 
interval —z < œ < x. A signal whose spectrum is different from zero in an 
interval is thus persistently exciting of any order. (m 


A sinusoid has a point spectrum that differs from zero at two points. It is 
thus persistently exciting of second order. A signal that is a sum of k sinusoids 
is persistently exciting of order 2k. The frequency domain characterization also 
makes it possible to derive the following result. 


THEOREM 29 PEof filtered signals 


Let the signal u be persistently exciting of order n. Assume that A(g) is a 
polynomial of degree m < n. The signal v defined by 


u(t) = A{gju(t) 


is then persistently exciting of order £ with n-m < £ < n. Assuming that A is 
stable, the signal w defined by 


1, 
A(q) 


is persistently exciting of order n. o 


w(t) = 


(t) 


Transfer Function Models 


The properties of parameter estimates for discrete-time transfer functions will 
now be discussed. The uniqueness of the estimates will first be explored. For 
this purpose it is assumed that the data is actually generated by 


A’ (q)y(t) = B(q)u(t) + e(t + 2) (2.47) 


where A? and B® are relatively prime. Let A and B be the estimates of A? 
and B°, respectively. If e = 0, deg A > deg A’, and deg B > deg B®, it follows 
from Theorem 2.1 that the estimate is not unique because the columns of the 
matrix & are linearly dependent. However, we have the following result. 


THEOREM 2.10 ‘Transfer function estimation 

Consider data generated by the model of Eq. (2.47), with A? stable and e = 0. 
Let the parameters of the polynomials A and B be fitted by least squares. 
Assume that the input u is persistently exciting of order deg A+ deg B +1. If it 
is further assumed that deg A = deg A? and deg B > B®, then liM ®’ B/t 
is positive definite. 
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Proof: Consider 
1 i 2 
oT um + Ht -Emt T 
V({8)=0 lim 5 ® $e lim = > (p* {k)@) 


Introduce 
u(t) = p” (t +n —1)6 = B(q)u{t) — (Ala) — 9") y(t) 


= B(g)u(t) — a B°(g)u(t) 


= {A°(q@)B(Q) — (Ala) — @") B°@)} gay t 


Since A? is stable, it follows from Theorem 2.9 that the signal 1/A°(q) - u(t) is 
persistently exciting of order deg A + deg B +1. Since the polynomial in braces 
has a degree lower than or equal to deg A +-deg B, it follows that the signal v(t) 
does not vanish in the mean square sense unless the polynomial is identically 
zero. This happens if 

B° (a) _ _B(a) 


A'a) Ala)- a 


Since deg A = deg A°, the denominator on the right-hand side thus has degree 
deg A — 1 = deg A’ — 1. The rational functions are then not identical, and the 
theorem is proved. Oo 


Remark 1. Notice that deg A+ deg B +1 is equal to the number of parameters 
in the model of Eq. (2.47). The order of PE required is thus equal to the number 
of estimated parameters. 
Remark 2. If the data is generated by Eq. (2.47), where {e(t)} is white noise 
{i.e., a sequence of uncorrelated random variables), then the matrix 
. li» 

lim — @* p 

too É 
is positive definite for models of all orders provided that the input is persis- 
tently exciting of order deg B + 1. D 


Theorem 2.2 does not automatically apply to estimation of parameters of 
a transfer function, because the output y appears in the regression vector. 
A consequence of this is that theoretical properties of the estimates can be 
established asymptotically only for large number of observations. 


Identification in Closed Loop 


In adaptive control, system identification is often performed under closed-loop 
conditions, which may give rise to certain difficulties. Consider, for example, 
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the estimation of the coefficients of a transfer function model as in Eq. (2.34). 
The matrix ® is then 


—y({n)} <- —y{1) u(r) zea (4) 
—y(n+1) ... —y(2) u(n+1) ... u(2) 

b= |. (2.48) 
—=y(t=- 1) ... —y(t—-n) uft-—1)... ult—n) 


A linear feedback of sufficiently low order introduces linear dependencies 
among the columns of the matrix ©. This means that the parameters cannot 
be determined uniquely. A simple example shows what may happen. 


EXAMPLE 2.10 Loss of identifiability due to feedback 
Consider a system described by 
y(t + 1) + ay(t) = bu{t) (2.49) 


Assume that the parameters a and b should be estimated in the presence of 
the feedback 
u(t) = —ky(t) (2.50) 


Multiplying Eq. (2.50) by a and adding to Eq. (2.49) give 
y(t +1) + (a + ak) y{t) = (b — a)u(t) 
This shows that any parameters such that 
G=a+ak 
b=b—~a 


give the same input-output relation. The above equation represents a straight 
line i 
b=b+z(a—â) (2.51) 


in parameter space (see Fig. 2.6). The least-squares loss function (2.2) has the 
same value for all parameters on this line. Oo 


The problem with lack of identifiability due to feedback disappears if a lin- 
ear feedback of sufficiently high order is used. Then the columns of the matrix 
® given by Eq. (2.48) are no longer linearly dependent. Another possibility is 
to have a time-varying feedback. For example, in Example 2.10 it is sufficient 
to have a feedback of the form 

u(t) = —ki y(t) — key(t — 1) 


with kg #0. Another possibility is to use a feedback law 
u(t) = —k{é)y(¢) 
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Slope -1/k 


Figure 2.6 Dlustration of lack of uniqueness in closed-loop identification. 


where & varies with time. For instance, in Example 2.10 it is sufficient to use 
two values of the gain. Each value of the gain corresponds to a straight line 
with slope —1/k in parameter space. Two lines give a unique intersection. 

In adaptive systems there is a natural time variation in the feedback 
because the feedback gains are based on parameter estimates. In a typical case 
the variance of the parameters decreases as 1/#, but more complex behavior is 
also possible. The following example shows what can happen. 


EXAMPLE 211 Convergence rate 
Consider data generated by 
y(t) + ay{t — 1) = bu(t — 1) + e(t) 
with a feedback of the form 
u(t) = —k (1 + 9) y(t) (2.52) 
where {v({t)} is a sequence of independent random variables that are also 


independent of {e(t)}. With the feedback law of Eq. (2.52) the closed-loop 
system becomes 
ee 
t)teliti 
T Me) + el +1) 


Given measurements up to t+ 1, the matrix &’& of the estimation problem is 
Ł 


ZFU DUU) 


J=1 


You) Dw) 


j=l 


+=- (a+ok+ 


plo = 


2.6 Simulation of Recursive Estimation 71 


It follows that 


Ł 


YIU) =k PU) — RY OZO w PU) s hto? 


j=l 
~~ oy 2 {Xr a, uU | s PUU) 
D= (Zro y A > ee ) 


a (20) + Ar) æ kh? oF (t+ a? loge) 


Hence for large t, 


t —kt 
oo wo? 
~ ey (a (aolen | 
The covariance matrix of the estimate is thus 
1 œ? 1 
2 ooto y3 
o? (aTe)! my 08 logt z klogt 
e o202 1 1 
klogt k? log t 


It now follows that 
3 _ _p) (eo) (1k)? w De 
cov (a-rb) =02(1 kh) (°) (1 —k) a 
POPE A S Tayl T OF 
cov (kâ +6) =o2(k 1) (To) (k 1) ~ Zotit 
The estimate will thus approach the line (2.51) at the rate 1/t. The estimate 
will then converge to the correct values at the rate 1/logt. The convergence 


along the line (2.51) is slower than convergence toward the line. D 


2.5 SIMULATION OF RECURSIVE ESTIMATION 


In this section, different properties of the recursive least-squares (RLS) method 
are illustrated through simulations. Throughout the section, data is generated 
by 

y(t) + ay(t — 1) = but — 1) + e(t} + ce(t — 1) (2.53) 
where a = —0.8,b = 0.5, and e(t) is zero mean white noise with standard 
deviation o = 0.5. Furthermore, c = 0, P{0) = 100. I, and @(0) = 0 except 
when indicated. In most cases we use 


ô= (5) oe-D=ie-) we) 


Only in Example 2.13 is the parameter c estimated. 
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EXAMPLE 2.12 Excitation 


The need for persistency of excitation is illustrated in this example. A simu- 
lation of the estimates when the input is a unit pulse at t = 50 is shown in 
Fig. 2.7(a). The estimate G appears to converge to the correct value, but the 
estimate b does not. The reason for this is that information about the a param- 
eter is obtained through the excitation by the noise. Information about the b 
parameter is obtained only through the pulse that is not persistently exciting. 

In Fig. 2.7(b) the experiment is repeated, but the input is now a square 
wave of unit amplitude and a period of 100 samples. Both â and 6 will converge 
to their true values, because the input is persistently exciting. The absolute 
values of the elements of P(t} are decreasing with time. For the simulation in 
Fig. 2.7(b) we have 


(1000) —0.796 0.550 1.114 B 
( (1000) ) = | 0.511 ) ELOI l 1.114 3.258 ) a 
According to Theorem 2.2 this implies the following standard deviations for the 

estimates: 
ga = 0.5V5.50- 107? = 0.012 
a; = 0.5V32.58 - 10° = 0.029 
The estimates are thus well within one standard deviation of their true values. 
a) 


0 200 400 600 800 1000 


0 200 400 600 800 1000 
Time 


Figure 2.7 The estimated (solid line) and true (dashed line) parameter 
values in estimating the parameters in (2.53). The input signal u(£) is (a) a 
unit pulse at ¢ = 50, (b) a unit amplitude square wave with period 100. 
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EXAMPLE 2.13 Model structure 


In this example, parameter c in Eq. (2.53) has the value —0.5. Figure 2.8(a) 
shows the estimates of parameters a and b. The estimates do not converge 
to their true values. This is because the equation error e(t} + ce(t — 1) is not 
white noise. The assumptions in Theorem 2.2 are thus violated. Figure 2.8{b) 
shows the estimates when the extended least squares (ELS) method is used. 
All three parameters a, b, and c are then estimated, and the estimates converge 
to the true values, When only a and 6 are estimated by using the least-squares 
method, the estimates and the P-matrix at time t = 1000 are 


a(1000 —0. 710 1. 
| ao) = | ol pao = (07% 1435) 49-3 
(1000) 0.697 1.435 3.903 


The elements in the P-matrix are small. This would indicate good accuracy 
if the process had fulfilled the assumptions about the noise structure. Theo- 
rem 2.2 gives the following estimates of the standard deviation of d and b: 


ca = 0.5/7.10 - 107° = 0.013 
o; = 0.539.038 - 107? = 0.031 


It is thus deceptive to judge the accuracy of the estimates by only looking at 
the P-matrix. It is necessary that the data be generated from a model of the 
form (2.12) to use the P-matrix for accuracy estimates. 


gg a een 
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Figure 2.8 Estimated parameters when the model (2.53) is simulated with 
c = —0.5 by using (a) LS and (b) ELS. 


74 Chapter 2 Real-Time Parameter Estimation 


0 200 400 600 800 1000 


0 200 400 600 800 1000 
Time 


Figure 2.9 Estimates when the control signal is generated through feedback 
(a) u(t) = —0.2y(£) and (b) u(t) = —0.82y(t — 1). 


If we did not observe that the equation error is not white, we could thus 
be strongly misled. One possibility to avoid mistakes is to also compute the 
correlation of the equation error and check whether it is white noise. o 


EXAMPLE 2.14 Closed-loop estimation 


Example 2.10 showed that identifiability can be lost owing to feedback. The 
estimates when the input is generated through the feedback 


u(t) = —0.2y(£) 


are shown in Fig. 2.9(a). The estimates converge to the wrong values. Notice, 
however, that the estimates are on the straight line (2.51). In Fig. 2.9(b) the 
feedback is more complex: 


u(t) = —0.32y(t — 1) 


The two control laws give approximately the same speed and output variance 
of the closed-loop system. Identifiability is now regained, and the estimates 
converge to the correct values. The phase plane, that is, 6 as a function of 
â, is shown in Fig. 2.10 for different initia! conditions when u(t} = —0.2y(£). 
The initial value of the P-matrix is P(0) = 0.017, and 20,000 steps have been 
simulated for each initial condition. The estimates converge to the identifiable 
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-2 -1 0 1 4 


Figure 2.10 Phase plane of the estimates when the system (2.53) is sim- 
ulated for different initial conditions when u(t) = —0.2y(t). The dashed line 
shows the identifiable subspace. The dot shows the true parameter values. 


subspace determined by 
é+0.26+0.7=0 


(Compare Eq. (2.51).) This line is dashed in the phase plane. The estimates 
are approaching the identifiable subspace along straight lines. The same initial 
conditions are simulated for the control law u{t) = —0.32y¥{¢ — 1) in Fig. 2.11. 
The estimates converge to the correct value (—0.8, 0.5), independent of the 
initial values, oO 


EXAMPLE 2.15 Influence of forgetting factor 
The recursive least-squares algorithm (2.21) has a forgetting factor 4. The 
influence of the forgetting factor is shown in Figure 2.12, When A = 1, the 
estimates become smoother and smoother, since the gain K(t) goes to zero. 
When A < 1, the estimator gain K(f} does not go to zero, and the estimates 
will always fluctuate. The fluctuations increase with decreasing 4. As a rule 
of thumb the “memory” of the estimator is 


nA 
1—A 


For A = 0.99 the estimates are based on approximately the last 200 steps. G 


N= 
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Figure 2.11 Phase plane of the estimates when the system (2.53) is simu- 
lated for different initial conditions when u(t) = —0.32y(t—1). The dot shows 
the true parameter values. 


- EXAMPLE 2.16 Different estimation methods 
In the previous examples the RLS and ELS methods were used. Simplified 
estimation methods based on projection were discussed in Section 2.2. Three 
different projection algorithms will now be compared with the RLS method. All 
have the following form: 

6(t) = 6(f — 1) + Pl) (y(t) — 9? HÂ- 1) (2.54) 


Compare with Eq. (2.24). The scalar gain P(t) is given by the following algo- 
rithms. 


Least mean squares (LMS): 
Pi) =y 
Projection algorithm (PA): 


Y 
P(t) = —— e>o0, O<y<2 
O= a+ FOO : 
Stochastic approximation (SA): 


me Y 
PO = OA 
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Figure 2.12 Estimates of the parameters in the process (2.53) when RLS 
is used and {a} A = 1, (b) A = 0.999, (c) A = 0.99, and (d} A = 0.95. 


The convergence properties of the four algorithms RLS, LMS, PA, and SA are 
compared in Fig. 2.13. All algorithms are initialized with @(0) = 0. The RLS 
method in Fig. 2.13(a) uses P(0) = 1007 and A = 1. Notice that the estimates 
move very quickly initially. The LMS method used in Fig. 2.13(b) has a constant 
gain y = 0.01. The estimates approach values that are close to the correct ones 
relatively quickly, but the estimates do not converge, since the gain is not 
decreasing. 

In the PA method in Fig. 2.13(c) the gain is normalized with o? (t)ọ (t). 
Further, @ = 0.1 and y = 0.01 are used. The approach toward the correct 
values is slower than for the LMS algorithm. However, the PA method is less 
sensitive than the LMS method to the size of the signals. 

The SA method is used in Fig. 2.13(d) with y = 0.2, and the estimates con- 
verge to the correct values even if the convergence is very slow. About 25,000- 
50,000 steps are needed before the estimates are close to the correct values. 
From the simulations it is seen that the recursive least-squares method has 
superior convergence properties. The price for this is the increase in computa- 
tions. ao 


The examples show that there are many things that influence the perfor- 
mance of the estimators. In adaptive control it is important to remember that 
the estimation is done in closed loop. 
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Figure 2.13 The estimates of the parameters in the process for different 
estimation methods. (a) Recursive least squares (RLS) with P(0) = 100 and 
A = 1. (b} Least mean squares (LMS) with y = 0.01. (c) Projection algorithm 


(PA) with œ = 0.1 and y = 0.01. (d} Stochastic approximation (SA) with 
y = 0.2. 


2.6 PRIOR INFORMATION 


There is a significant advantage in incorporating available prior mformation. 
It reduces the number of parameters that have to be estimated, improves the 
precision of the estimates, and reduces the requirements on excitation. 

Prior information typically relates to properties of a model. It can, for 
instance, represent knowledge of time constants of an actuator. This type of 
knowledge is easy to incorporate in an indirect adaptive algorithm. However, 
it may be difficult to incorporate in a direct adaptive algorithm, since process 
parameters influence controller parameters in a complicated fashion. Since 
prior knowledge is often related to the continuous-time models it is easier to 
use for continuous time than for discrete time self-tuners. These properties are 
highlighted by a few examples. 


` EXAMPLE 217 Prior information in continuous time 
Consider the continuous-time system with the transfer function 
83 


G(s) = Ty es) + Ons) 
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The parameter @, is assumed to be known; 6a and ĝa are unknown. If we 
introduce the filtered signal č defined by 


_ 1 
i= FOE u 
the input-output relation may be written as 
dy : 
The estimation problem thus reduces to estimation of parameters 83 and @2 of 
the first-order system given by Eq. (2.55). D 


The example thus shows that it is straightforward to handle prior infor- 
mation for the continuous-time model. The next example illustrates some com- 
plications that occur when the model is sampled. 


EXAMPLE 2.18 Prior information in sampled models 


Consider the system in Example 2.17. Sampling the system with sampling 
period A gives the pulse transfer operator 


big + be 
Hq) = ——————_ 
(9) gq? + aig + a2 
where 
A, (1 — e) — 62 (1 — e*®) 
bı = 83 —- 3c 
ĝi — 8 
Boll — eia eo n g (1 — e hl) Alf 
pa g POTA) o (1— ehe 
8 — 8z 
ay = —(e 1% 4 e™ 2) 
ag = ent er+1/Ba)h 


The pulse transfer function is nonlinear in 6), 62, and 83. Further, both param- 
eters appear in all the coefficients of the discrete-time pulse transfer function. 
This implies that a change in the unknown time constant 82 will influence all 
the coefficients in the sampled data model. There is, however, some structure 
in the parameter dependence. The denominator polynomial can be written as 


g + ag + ag = (q — oH) (q — e) 
= (¢— 1) (q — a2) 


When @, is known, one factor of A(g) is thus known. By reparameterization 
the sampled model can be written as 


big + be 


AD (q —a1)(q — æa) 
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The prior information can be used to reduce the estimated parameters from 4 
to 3. Further simplifications can be made when the sampling interval is small 
in comparison with 6; and 62, A series approximation of bı and be in h gives 


83 L 8(8ı + 82) 


b h? h? 
1 26102 60202 

8s 2 43(614+ 42) 19 

ba © 96,05" oer 


For short sampling periods we have 


bi xb 
1 bem oe gh 


The model can now be described by 


k(q+1) 


B® = Gr an)(q— a) 


where parameter œ is known and &s and k = 63/(h?6162) are unknown. O 


The observation about the structure of the sampled model for small sam- 
pling periods is a consequence of a general result about how poles and zeros 
are transformed by sampling. If æ; is a pole of a continuous-time system, then 
the sampled system has a pole at exp(a@jh). 

There are no simple, exact formulas for transforming the zeros, For short 
sampling periods, however, a zero f; is approximately transformed to exp(f;h). 
If d is the pole excess of the continuous-time system, there will be d — 1 
additional zeros of the sampled system. The limiting positions of these zeros 
as the sampling period goes to zero are given by Theorem 6.9 in Chapter 6. In 
this way it is possible to use prior information in terms of poles and zeros both 
for continuous-time selftuners and for discrete-time self-tuners with a short 
sampling period. 

Example 2.18 shows that how the process model is parameterized is cru- 
cial. Different parameterizations can be attempted. This is illustrated by an 
example. 
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Figure 2.14 The circuit in Example 2.19. 


EXAMPLE 2.19 Reparameterization 
Consider the circuit in Fig. 2.14. The state space representation is 


E-ha aC): 


y=(0 R)x 


and the transfer function is 


= 
oe fo 
2 2 —— = 
PIFI 


Let Ay =F, 82 = L/L, and 03 = 1/C. Then 


018283 


G(s) = s? + 8,005 + 6203 


The coefficients are nonlinear (although of special structure) in the physical 
parameters R, 1/L, and 1/C. The system can be written as 


kı 
s? + kos + Rg 


and it is possible to make an estimation of 81, 62, and @3 by using Eq. (2.56). 
However, the estimates must be constrained such that the relations 


G(s) = (2.56) 


kı = 0189203 
ko = 6182 
ks = 0203 
are fulfilled. o 


For indirect self-tuning regulators it is possible to estimate the continuous- 
time process parameters from discrete-time measurements. The model can then 
be sampled and the controller designed for the chosen sampling interval 
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2.7 CONCLUSIONS 


In this chapter we have introduced recursive parameter estimation, which is 
a key ingredient in adaptive control. The presentation has been focused on 
the least-squares method, which is a simple but useful technique. In the next 
chapter we will show how the method is used in adaptive systems. System 
identification involves several important issues that we have not discussed. 
One is model validation; another is computational aspects. These issues are 
discussed in detail in Chapter 11. 


PROBLEMS 


2.1 Consider the function 
V(x) =x? Ax+b x+e 


where x and b are column vectors, A is a matrix, and c¢ is a scalar. Show 
that the gradient of function V with respect to x is given by 


grad, V =(A+A™)x+b 
This can be used to find the minimum of Eq. (2.7). 
2.2 Consider the FIR model 


y(t} = bou(t) + byu(t — 1) + e(t) i= 1,2,... 


where {e(¢}} is a sequence of independent normal N(0,c) random vari- 
ables. 


(a) Determine the least-squares estimate of the parameters bọ and bı 
when the input signal u is a step. Analyze the covariance of the 
estimate when the number of observations goes to infinity. Relate 
the results to the notion of persistent excitation. 


(b) Make the same investigation as in part (a) when the input signal is 
white noise with unit variance. 


2.3 Consider data generated by the discrete-time system 
y(t) = bou(t) + biu(t — 1) + e(t) 


where {e(t)} is a sequence of independent N(0,1) random variables. 
Assume that the parameter b of the model 


y(t) = bu(t) 
is determined by least squares. 


2.4 


2.5 


2.6 


2.7 


Problems 83 


(a) Determine the estimates obtained for large observation sets when the 
input u is a step. (This is a simple illustration of the problem of fitting 
a low-order model to data generated by a complex model. The result 
obtained will critically depend on the character of the input signal.) 

(b) Make the same investigation as in part (a) when the input signal is 
a sequence of independent (0,0) random variables. 


Determine which of the input signals below are persistently exciting of 
at least order 4. 
(a) 


u(t) = ap +a; sin øt a;#0, i=0,1 


(b) os 
= qh. 
ul) = Grade 08 
where v(t) is persistently exciting of order 5. 
(c) 
u(t) = —#— 95 _iyyy 


(a — 0.4)(g — 0.6) 


where v(t) has a spectrum (œ) that is not equal to zero in the 
interval 1 < @ < 2. 


Consider the discrete-time system 
y(t + 1) + ay(t) = bu(t) + e(é + 1) 


where the input signal u and the noise e are sequences of independent 
random variables with zero mean values and standard deviation o and 1. 
Determine the covariance of the estimates obtained for large observation 
sets. 


Consider data generated by the least-squares model 
y(é + 1) + ay(t) = bu(t) + e(t + 1) + ce(t) #=1,2,... 


where {u(é)} and {e(t)} are sequences of independent random variables 
with zero mean values and standard deviations 1 and o. Assume that 
parameters a and b of the model 


y(t + 1) + ay(t) = bu(t) 


are estimated by least squares. Determine the asymptotic values of the 
estimates. 


Consider least-squares estimation of the parameters bı and be in 


y(t) = byu(t) + beu(t — 1) 


84 


2.8 


2.9 


2.10 


2.11 
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Assume that the following measurements are obtained: 


E Li ¥ 
1 1000 — 
2 1001 2001 
3 1000 2001 


Discuss the numerical properties of computing the estimates directly and 
by the normal equations. 


Consider the model 
yi) =a+b-t+e(t)  Ł=1,2,3,... 


where {e(f)} is a sequence of uncorrelated N(0,1) random variables. 
Determine the least-squares estimate of the parameters a and b. Also 
determine the covariance of the estimate. Discuss the behavior of the 
covariance as the number of estimates increases. 


Consider the model in Problem 2.8, but assume continuous-time observa- 
tion, where e(t) is white noise, that is, a random function with covariance 
6(£). Determine the estimate and its covariance. Analyze the behavior of 
the covariance for large observation intervals. 


Consider data generated by 
y(t) = b + eft) £=1,2,...,N 


where {e(t);£ = 1,3,4,...} is a sequence of independent random variables. 
Furthermore, assume that there is a large error at t = 2, that is, 


e(2) =a 
where a is a large number. Assume that the parameter b in the model 


y(t) =b 


is estimated by least squares. Determine the estimate obtained, and 
discuss how it depends on a. (This is a simple example that shows how 
sensitive the least-squares estimate is with respect to occasional large 
errors.) 


Consider Example 2.12. Analyze the asymptotic properties of the P- 
matrix and explain the simulation in Figs. 2.7(a) and 2.7(b). 


2.12 Show that Eq. (2.11) minimizes the weighted least-squares loss function 


(2.10). 


Problems 85 


2.13 Consider Eqs. (2.21) with the initial condition 6(0) = @) and P(0) = Po. 


2.14 


2.15 


2.16 


Show that 6(£) minimizes the criterion 


t 


v(e, 9 = 5 >A (W — 9" (0)? + 5 (0 — 00)" Po? (0 — 80) 


i=l 
Compare Theorem 2.4. 
Consider the following model of time-varying parameters: 
A(é) = ©, A(t — 1) + v(t) 
y(t) = P” (O(t) + elt) 


where {v(i),é = 1,2,...} and {e(é),¢ = 1,2,...} are sequences of inde- 
pendent, equally distributed random vectors with zero mean values and 
covariances A, and Rz, respectively. Show that the recursive estimates 
of 6 are given by 


6(t) =(t — 1) + K (£) (y(t) — 9? (HÊ — 1) 
K(i) =®,P(t — 1)9(t— 1) (Ro + 9" (t — 1)PE — 1 g(t-1))~ 
P(t) =, P(t —1)07 + Ry 
— &,P(t— 1)e(t) (Re + 97 (P(E Ie) TOPE- Yet 


Show that Eq. (2.28) minimizes Eq. (2.27), and use this to prove Theo- 
rem 2.5, Hint: Use Remark 1 in Theorem 2.5 and that the time derivative 
of the identity J = PP! is 
dP P d(P-) 
di dt 
In an adaptive controller the process parameters are estimated according 
to the model 


P 


y(t) + ary(é — 1) + cey(t — 2) = bou(t — 1) + biu(t — 2) + e(t) 
The controller has the structure 
u(t) + riu(t — 1) = —soy(t) — sıy(t — 1) 


The reference value is thus zero. Consider the case in which the controller 
parameters are constant. 


(a) Show that the parameters ai, de, bp, and bı cannot be uniquely de- 
termined. 


(b) Characterize the parameter combinations that can be determined. 
(c) Show that with the controller structure 


u(t) +ryu(t — 1) +reu(t— 2) = —soy(t) — s1y(t — 1) — say(t — 2) 


all process parameters can be estimated uniquely. 


217 


2.18 


2.19 


2.20 
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Figure 2.15 Closed-loop estimation scheme for Problem 2.17. 


Figure 2.15 shows a closed-loop system for estimation of the unknown 
constant b in the pulse transfer function H (q) = b/(q +a). The constant 
a is known and is such that |a| > 1. This means that the open-loop system 
is unstable, and to have bounded signals for the estimation, we need to 
stabilize the system with a controller. This is done with a P controller 
with gain K such that |e + Kb| < 1. The estimator is a least-squares 
(LS) estimator that is based on the regression model 


Ht) = 9" (t- 1) 
where 
H(t) = y(t) + ay(t — 1) 
p(t—1) = u(t—1) 
@=b 


(a) Determine the asymptotic LS estimate of @ = b when d = 0 and {u,} 
is a sequence of independent, equally distributed random variables 
with zero mean and variance g? {i.e., te is a white noise signal). 

(b) Determine the asymptotic LS estimate of b when u, = 0 and d = dy = 
constant. 


(c) Discuss the case of least-squares estimation of 6 when u, is as in part 
(a) and d = do = constant. What should be done to avoid a biased 
estimate of b? 


Write a computer program to simulate the recursive least-squares esti- 
mation problem. Write the program so that arbitrary input signals can 
be used. Use the program to investigate the effects of initial values on 
the estimate. 


Use the program from Problem 2.18 to estimate the parameters a and b 
in Problem 2.6. Investigate how the bias of the estimate depends on c. 


Consider the estimation problem in Problem 2.6. Use the computer pro- 
gram developed in Problem 2.18 to explore what happens when the control 
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Signal u is generated by the feedback 


u(t) = —ky(t) 
Try to support your observations by analysis. 


2.21 Consider the open-loop system in Section 2.5 when c = 0. Let the input 
signal be a square wave with unit amplitude and a period of 100 sam- 
ples. Investigate through simulations the convergence and behavior of the 
parameter estimates when varying: 


(a) The initial value 6(0). 

(b) The initial value of the covariance matrix P(0). 
(c) The forgetting factor À. 

(d) The period of the input signal. 
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CHAPTER 3 


DETERMINISTIC 
SELF-TUNING REGULATORS 


3.1 INTRODUCTION 


Development of a control system involves many tasks such as modeling, de- 
sign of a control law, implementation, and validation. The self-tuning regula- 
tor (STR) attempts to automate several of these tasks. This is illustrated in 
Fig. 3.1, which shows a block diagram of a process with a self-tuning regulator. 
It is assumed that the structure of a process model is specified. Parameters of 
the model are estimated on-line, and the block labeled “Estimation” in Fig. 3.1 
gives an estimate of the process parameters. This block is a recursive estimator 
of the type discussed in Chapter 2. The block labeled “Controller design” con- 
tains computations that are required to perform a design of a controller with 
a specified method and a few design parameters that can be chosen externally. 
The design problem is called the underlying design problem for systems with 
known parameters. The block labeled “Controller” is an implementation of the 
controller whose parameters are obtained from the control design. 

The name “self-tuning regulator” comes from one of the early papers. 
The main reason for using an adaptive controller is that the process or its 
environment is changing continuously. It is difficult to analyze such systems. 
To simplify the problem, it can be assumed that the process has constant but 
unknown parameters. The term self-tuning was used to express the property 
that the controller parameters converge to the controller that was designed if 
the process was known. An interesting result was that this could happen even 
if the model structure was incorrect. 

The tasks shown in the block diagram can be performed in many differ- 
ent ways. There are many possible choices of model and controller structures. 
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' Self-tuning regul lator ee ] 
Specification | Process parameters | 


Controller 
design 


Controller 
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Process 


Figure 3.1 Block diagram of a self-tuning regulator. 


Estimation can be performed continuously or in batches. In digital implemen- 
tations, which are most common, different sampling rates can be used for the 
controller and the estimator, It is also possible to use hybrid schemes in which 
control is performed continuously and the parameters are updated discretely. 
Parameter estimation can be done in many ways, as was discussed in Chap- 
ter 2. There is also a large variety of techniques that can be used for control 
system design. It is also possible to consider nonlinear models and nonlinear 
design techniques. Although many estimation methods will provide estimates 
of parameter uncertainties, these are typically not used in the control design. 
The estimated parameters are treated as if they are true in designing the 
controller. This is called the certainty equivalence principle. 

The controller shown in Fig. 3.1 is thus a very rich structure. Only a few 
possibilities have been investigated. The choice of model structure and its pa- 
rameterization are important issues for self-tuning regulators. A straightfor- 
ward approach is to estimate the parameters of the transfer function of the 
process. This gives an indirect adaptive algorithm. The controller parameters 
are not updated directly, but rather indirectly via the estimation of the process 
model. 

Often, the model can be reparameterized such that the controller parame- 
ters can be estimated directly. That is, a direct adaptive algorithm is obtained 
(compare with the discussion of the direct MRAS in Section 1.4). There has 
been some confusion in the nomenclature. In the self-tuning context, indirect 
methods have often been called explicit self-tuning control, since the process 
parameters have been estimated. Direct updating of the controller parameters 
has been called implicit self-tuning control. In the early papers on adaptive con- 
trol a direct adaptive controller was often referred to as an adaptive controller 
without identification. It is convenient to divide the algorithms into indirect 
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and direct self-tuners, but the distinction should not be overemphasized. The 
basic idea in both types of algorithms is to identify some parameters that are 
related to the process and/or the specifications of the closed-loop system. 

The purpose of this chapter is to present the basic ideas and to illustrate 
some properties of self-tuning regulators. It is assumed that the process model 
and the controller are linear systems. The discussion will also be restricted 
to single-input, single-output (SISO) systems. In most cases we will assume 
that the controler is sampled and that estimation and control are performed 
with the same sampling rates. Recursive least squares will be used for param- 
eter estimation, and the design method is a deterministic pole placement. The 
reasons for these choices are mostly didactic; we would like to present simple 
methods that can be used in practice. Least-squares estimation was discussed 
in Chapter 2. In Section 3.2 we present the design method used in a simple 
setting. A straightforward combination of least-squares estimation and pole 
placement design gives an indirect self-tuning regulator. The sampled version 
is described in Section 3.3, and the continuous-time version is described in 
Section 3.4. In Section 3.5 we show how a direct self-tuning regulator is ob- 
tained. In this section we also discuss hybrid algorithms that combine features 
of direct and indirect algorithms. In Section 3.6 we discuss how to modify the 
adaptive controllers so that they can deal with disturbances. 


3.2 POLE PLACEMENT DESIGN 


A simple method for control design will now be presented. The idea is to deter- 
mine a controller that gives desired closed-loop poles. In addition it is required 
that the system follows command signals in a specified manner. This is a simple 
method that, properly applied, can give practically useful controllers as well as 
useful understanding of adaptive control. It is also the key to understand the 
similarities between the self-tuning regulator and the model reference adaptive 
controller. 


Process Model 


It is assumed that the process is described by the single-input, single-output 
(SISO) system 


A(g)y(t) = B(g) (u(t) + v(t)) 


where y is the output, u is the input of the process, and v is a disturbance. The 
disturbances can enter the system in many ways. Here it has been assumed 
that they enter at the process input. For linear systems in which the super- 
position principle holds, an equivalent input disturbance can always be found. 
Furthermore, A and B are polynomials in the forward shift operator q. The 
polynomials have the degrees deg A = n and deg B = deg A—dp. Parameter do, 
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Controller 


Figure 3.2 A general linear controller with two degrees of freedom. 


which is called the pole excess, represents the integer part of the ratio of time 
delay and sampling period. It is sometimes convenient to write the process 
model in the delay operator gq). This can be done by introducing the reciprocal 


polynomial 
A*(q*) = "A(q) 
where n = deg A. The model can then be written as 
A*(q")9(t) = B*(q™) (ult — do) + v(t — do)) 


where 

Atg) =1+ag +...+anq" 

B*(q*) = bo + big? +... + bmg” 
with m = n — do. Notice that since n was defined as the degree of the system, 
we have n > m + do, and trailing coefficients of A* may thus be zero. 

We will mostly deal with discrete time systems. Since the design method is 
purely algebraic, we can handle continuous systems simultaneously by writing 
the model 

Ay(t) = B (u(t) + v(8)) (3.1) 
where A and B denote polynomials in either the differential operator p = d/dé 
or the forward shift operator q. It is assumed that A and B are relatively prime, 
that is, that they do not have any common factors. Further, it is assumed that 
A is monic, that is, that the coefficient of the highest power in A is unity. 

A general linear controller can be described by 


Ru(t) = Tue(t) — S(t) (3.2) 
where #, S, and T are polynomials. This control law represents a negative 
feedback with the transfer operator —S/R and a feedforward with the transfer 
operator T/R. It thus has two degrees of freedom. A block diagram of the 


closed-loop system is shown in Fig. 3.2. Elimination of u between Eqs. (8.1) 
and (3.2) gives the following equations for the closed-loop system: 


BT BR 
“= Fry Bs + apy as O 29 
AT BS | 


u(t) = Fey Bs — ary Bs O 
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The closed-loop characteristic polynomial is thus 
AR + BS = A, (3.4) 


The key idea of the design method is to specify the desired closed-loop char- 
acteristic polynomial A.. The polynomials R and S can then be solved from 
Eq. (3.4). Notice that in the design procedure we consider polynomial A, to be 
a design parameter that is chosen to give desired properties to the closed-loop 
system. Equation (3.4), which plays a fundamental role in algebra, is called 
the Diophantine equation. It is also called the Bezout identity or the Aryab- 
hatta equation. The equation always has solutions if the polynomials A and 
B do not have common factors. The solution may be poorly conditioned if the 
polynomials have factors that are close. The solution can be obtained by intro- 
ducing polynomials with unknown coefficients and solving the linear equations 
obtained. The solution of the equation is discussed in detail in Chapter 11. 


Model-Following 


The Diophantine equation (3.4) determines only the polynomials F and S. 
Other conditions must be introduced to also determine the polynomial T in 
the controller (3.2). To do this, we will require that the response from the 
command signal u, to the output be described by the dynamics 


AmY¥m(t) = Brttc(t) (3.5) 
It then follows from Eqs. (3.3) that the folowing condition must hold: 
T BT Bn 
= (3.6) 


AR+BS A Am 
This model-following condition says that the response of the closed-loop system 
to command signals is as specified by the model (3.5). Whether model-following 
can be achieved depends on the model, the system, and the command signal. 
If it is possible to make the error equal to zero for all command signals, then 
perfect model-following is achieved. 

The consequences of the model-following condition will now be explored. 
Equation (3.6) implies that there are cancellations of factors of BT and A,. 
Factor the B polynomial as 

B= Bt B- (3.7) 


where B+ is a monic polynomial whose zeros are stable and so well damped 
that they can be canceled by the controller and B- corresponds to unstable or 
poorly damped factors that cannot be canceled. It thus follows that B- must 
be a factor of B,,. Hence 

Bm = B7B} (3.8) 


Since Bt is canceled, it must be a factor of A.. Furthermore, it follows from 
Eq. (3.6) that Am must also be a factor of Ae. The closed-loop characteristic 
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polynomial thus has the form 
A, = ApAn Bt (3.9) 


Since Bt is a factor of B and A,, it follows from Eq. (3.4) that it also divides 
E. Hence 


R = R’B* (3.10) 
and the Diophantine equation (3.4) reduces to 
AR’ + BTS = A,Am = Al (3.11) 
Introducing Eqs. (3.7), (3.8), and (3.9) into Eg. (3.6) gives 
T = A,B', (3.12) 


Causality Conditions 


To obtain a controller that is causal in the discrete-time case or proper in the 
continuous-time case, we must impose the conditions 


deg S < deg R 


deg T < deg R (3.13) 


The Diophantine equation (3.4) has many solutions because if R° and S° are 
solutions, then so are 
R=R°+@QB 
S=S°-@QA 


where @ is an arbitrary polynomial. Since there are many solutions, we may 
select the solution that gives a controller of lowest degree. We call this the 
minimum-degree solution. Since deg A > deg B, the term of highest order on 
the left-hand side of Eq. (3.4) is AR. Hence 


(3.14) 


deg R = deg A, — deg A 


Because of Eqs. (3.14) there is always a solution such that deg S < deg A =n. 
We can thus always find a solution in which the degree of S is at most deg A—1. 
This is called the mintmum-degree solution to the Diophantine equation. The 
condition deg S < deg R thus implies that 


deg A, > 2dezg A— 1 
It follows from Eq. (3.12) that the condition deg T < deg R implies that 
deg Am — deg B; > deg A — deg B* 


Adding deg B~ to both sides, we find that this is equivalent to deg Am — 
deg Em > do. This means that in the discrete-time case the time delay of the 
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model must be at least as large as the time delay of the process, which is a very 
natural condition. Summarizing, we find that the causality conditions (3.13) 
can be written as 


deg A, > 2degA—1 
deg Am — deg Bm > deg A — deg B = do 
It is natural to choose a solution in which the controller has the lowest possible 
degree. In the discrete-time case it is also reasonable to require that there be no 


extra delay in the controller. This implies that polynomials R, S, and T should 
have the same degrees. The following design procedure is then obtained. 


(3.15) 


ALGORITHM 3.1 Minimum-degree pole placement (MDPFP) 
Data: Polynomials A, B. 


Specifications: Polynomials Am, Bm, and A,. 


Compatibility Conditions: 
deg Am = deg A 
deg Bm = deg B 
deg A, = deg A — deg Bt — 1 
B, = B7B; 


Step 1: Factor B as B = BtB-, where B+ is monic. 
Step 2: Find the solution FR’ and S with deg S < deg A from 


AR’ + B-S =A,Ap 


Step 3: Form R = R'Bt and T = A,B}, and compute the control signal from 
the control] law 
Ru = Tu, — Sy 0 


There are special cases of the design procedure that are of interest. 


All zeros are canceled The design procedure simplifies significantly in the 
special case in which all process zeros are canceled; then deg A, = deg A — 
degB —1 = dy — 1. It is natural to choose Bm = A,,(1)qg*%. Then the 
factorization in Step 1 is very simple, and we get B~ = bo, Bt = B/bo. 
Furthermore, T = A,B}, = Aog & Am(1})/bo, and the Diophantine equation 
in Step 2 reduces to 

AR’ + boS = AL = ApAm 


This equation is easy to solve because F’ is the quotient and 5S is the remain- 
der when A,A,, is divided by A. However, all process zeros must be stable and 
well damped to allow cancellation. 
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No zeros are canceled The factorization in Step 2 also becomes very simple 
if no zeros are canceled. We have B+ = 1, B- = B, and B,, = BB, where 
P = A,,(1)/B(1). Furthermore, deg A, = deg A — deg B — 1 and T = BA,. 
The closed-loop characteristic polynomial is A, = A,A,,, and the Diophantine 
equation in Step 2 becomes 


AR + BS = A, = A,An 


Examples 


The model-following design is illustrated by three examples 


EXAMPLE 3.1 Model-following with zero cancellation 
Consider a continuous-time process described by the transfer function 


1 
CO) = seat 


This can be regarded as a normalized model for a motor. The pulse transfer 
operator for the sampling period k = 0.5 s is 


foe ee ee 
UA) @+ag+a, g —1.6065q + 0.6065 


We have deg A = 2 and deg B = 1. The design procedure thus gives a first-order 
controller, and the closed-loop system will be of third order. The sampled data 
system has a zero in —0.84 and poles in 1 and 0.61. Let the desired closed-loop 
system be 


(3.16) 


(8.17) 


Bn (Q) _ bmod B 0.1761q 
AmQ) +Hümgtam 9g? — 1.8205q + 0.4966 


(3.18) 


This corresponds to a natural frequency of 1 rad/s and a relative damping 
of 0.7. Parameter bmo is chosen so that the static gain is unity. This model 
satisfies the compatibility conditions because it has the same pole excess as 
the process and the process zero is stable although poorly damped. To apply 
the design procedure in Algorithm 3.1, we first factor the polynomial B, and 


we obtain 

B*(q) = q + bi/bo 

B-(q) = bo 

Ban (g) = bmog/ bo 
Since the process is of second order, the polynomials R, S, and T will all be of 
first order. Polynomial A” is thus of degree zero. Since the polynomial is monic, 
we have R’ = 1. Since deg B+ = 1, it follows from the compatibility conditions 
that deg A, = 0. Choose 

A,(q) =1 
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The Diophantine equation (3.11) then becomes 
(g? + aig + ag) - 1+ bo(sog + 51) = QÊ + amig + dma 
Equating coefficients of equal power of q gives 


a) + bosa = Ami 
ag + bosi = Ong 


These equations can he solved if bp Æ 0. The solution is 


aa Qy,1 — Ay 
Pee Gm2 — ag 
1= E 
The controller is thus characterized by the polynomials 
b 
R(q) = Bt =q + > 
bo 
S(q)=soq +s 
= Eo bmog 
Tla) = AB, = -5 z 
0 


The process in Example 3.1 has a zero that is stable but poorly damped. 
The continuous-time equivalent corresponds to a zero with relative damping 
¢ = 0.06. We will therefore also determine a controller that does not cancel the 
zero, This is done in the next example. 


: EXAMPLE3.2 Model-following without zero cancellation 
Consider the same process as in Example 3.1, but use a control design in which 
there is no cancellation of the process zero. Since the process is of second order, 
the minimum-degree solution has polynomials R, S, and T of first order and the 
closed-loop system will be of third order. Since no zero is canceled, it follows 
from the compatibility condition in Algorithm 3.1 that deg Ap = 1. Since no 
process zeros are canceled, we have 
B+ =1 
B- = B = bog + bı 
It also follows from the compatibility conditions that the model must have the 
same zero as the process. The desired closed-loop transfer operator is thus 
bog + bi bmog + Omi 
H m — CS aM 
=p Q? + Omid + Ome P + nig + Ame 


where bmo = Bbo and 
bo + by 
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which gives unit steady state gain. The Diophantine equation (3.4) becomes 


(g +019 +az)(q +r1) + (bog + 41) (80g +81) = (Q +amıg +am2)(g +80) (3.19) 
Putting g = —b,/6o and solving for rı, we get 


_ bı , (bÌ — amibobi + amebg)(—b1 + dobo) 


rı = 
+” Bo bolb? — arbobi + agb) 


z aoüm2b0 + (A2 — mz — GoGmi)bobs + (ao + Ami — a1)b? 


b? — @)b9b) + azb% (3.20) 


Notice that the denominator is zero if polynomials A(q) and B(q) have a 
common factor. Equating coefficients of terms g? and g° in Eq. (8.19) gives 


oe by (aoam1 — 02 — Gm 101 + 0 + Am2 — 2104) 
b? — aybob; + azb? 
bo(Gm1a2 — 4142 — GoAme + a82) 
bi — &1bob1 + azb? 
_ bi(&102 — m1082 + tome — Goz) 
= b? — 416061 + azb? 
bofazäm2 — 43 — üolm201 + AoG20m1) 
bi — &ıbob1 + azb? 


+ 
(3.21) 


Furthermore, it follows from Eq. (3.12) that 


T(g) = BA.(q) = (g + G0) D 


Since the design method is purely algebraic, there is no difference between 
discrete-time systems and continuous-time systems. We illustrate this by an 
example. 


EXAMPLE 33 Continuous-time system 
The process discussed in Examples 3.1 and 3.2 has the transfer function 


_ b 
~ 8(s +a) 


G(s) 


with a = 1 and b = 1. The design procedure given by Algorithm 3.1 will now 
be used to find a continuous-time controller. Since the process is of second 
order, the closed-loop system will be of third order and the minimum-degree 
controller is of first order. Polynomial Am has degree two, Bm is a constant, 
and A, has degree one. We choose 


A,{s) =s +a 
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and let the desired response be specified by the transfer function. 


Bn{s) _ w? 
An(s) s?+2€@s+ 0? 


The Diophantine equation (3.4) becomes 
s(s + a)(s +11) + b(sos + 81) = (3 + 26 @s + @7)(s + a) 
Equating coefficients of equal powers of s gives the equations 
a+r, = 200+, 


arı + bso = O° + 20 wa, 


bs} = ao 
If b Æ 0, these equations can be solved, and we get 


rı = 20 + ao— a 
a2 @ + 0? — arı 
So = — = 


Furthermore, we have Bt = 1, B- = b, and B}, = œ?/b. It then follows from 
Eq. (3.12) that 


T(s) = Bi (s)Ao(s) = Č (e+ a) : 


An Interpretation of Polynomial A, 


It is possible to give an interpretation of the polynomial A, that appears in 
the minimum-degree pole placement solution in the case in which no process 
zeros are canceled. To do this, we observe that the pole placement problem can 
also be solved with state feedback and an observer. The closed-loop dynamics 
are then composed of two parts: one that corresponds to the state feedback and 
another that corresponds to the observer dynamics. For a system of degree n 
it is also known that it is sufficient to use an observer of degree n — 1. When 
no process zeros are canceled, the closed-loop characteristic polynomial in our 
case is A,,A,, where Am is of degree n and A, is of degree n — 1. By this 
analogy we can interpret the polynomial A,, as being associated with the state 
feedback and A, as being associated with the observer. We will therefore call 
A, the observer polynomial. In a system with state feedback it is also natural 
to introduce the command signals in such a way that they do not generate 
observer errors. This means that the observer polynomial is canceled in the 
transfer function from command signal to process output. 
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Relations to Model-Following 


Many other design methods can be related to pole placement. We will now 
show that pole placement can be interpreted as a model-following design. This 
is of interest because much work on MRAS is formulated in terms of model- 
following. Model-following generally means that the response of a closed-loop 
system to command signals is specified by a given model.. This means that 
both poles and zeros of the model are specified by the user. Pole placement, 
on the other hand, specifies only the closed-loop poles. In the minimum-degree 
pole placement procedure we did, however, introduce some auxiliary conditions 
that included the process zeros. We will now show that the control law given 
by Eq. (3.2) can be interpreted as model-following. It follows from Eqs. (3.11) 
and (3.12) that 

T AB, SBm 

R BA, RAp, 


The control law of Eq. (3.2) can be written as 


Re R!” BA," RA, R 
AB,, 


5 
= BA,” - POT In) 


A block diagram representation of this controller is given in Fig. 3.3. The figure 
shows that the controller can be interpreted as a combination of a feedforward 
controller and a feedback controlier. The feedforward controller attempts to 
cancel the plant dynamics and replace it with the response of the model B,, /Am. 
Also the feedback attempts to make the output follow this model. It is thus 
clear that the control law (3.2) can indeed be interpreted as a model-following 
algorithm. 


Figure 3.3 Alternative representation of model-following based on output 
feedback. 
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Notice that Fig. 3.3 is useful for the purpose of giving insight but that 
the controller cannot be implemented as shown in the figure because the 
inverse process mode] A/B is generally not realizable. Furthermore, A/B 
will be unstable if the system is non-minimum phase. However, the cascade 
combination of the reference model and the inverse process model is realizable 
if the mode]-following problem is well posed, that is, if Eqs. (3.13) are satisfied. 
Notice that the reference model and the inverse process model can be nonlinear 
without causing any stability problems because they appear only as part of a 
feedforward compensator. 


Summary 


In this section we have presented a straightforward design procedure that is 
relatively easy to use. The key problem in applying pole placement is to choose 
the desired closed-loop poles and the desired response to command signals. 
The choice is easy for low-order systems, but it may be difficult for systems of 
high order when many poles must be specified. Bad choices may result in a 
closed-loop system with poor sensitivity. In later chapters we will discuss this 
problem in more detail. 

In the sampled-data case the sampling interval is a crucial design param- 
eter. It is important to choose the sampling interval in relation to the desired 
closed-loop poles. 


3.3 INDIRECT SELF-TUNING REGULATORS 


Methods for estimating parameters of the model given by Eq. (3.1) were pre- 
sented in Chapter 2. These methods will now be combined with the design 
method of Section 3.2 to obtain a simple self-tuning regulator. For simplicity 
it will be assumed that the disturbance v in Eq. (3.1) is zero. 


Estimation 
Several of the recursive estimation methods outlined in Chapter 2 can be used 
to estimate the coefficients of the A and B polynomials. The equations for 
recursive least-squares estimation will be used. The process model (3,1) can be 
written explicitly as 

y(t) = — a(t — 1) — ayt — 2) —...— anyit — n) 

+ bou(t — do) +... + bmult — do — m) 

Notice that the degree of the system is max(n, dọ + m). The model is linear in 
the parameters and can be written as 


y(t) = 9" (t—1)8 
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where 
OB’ =(a, dz ... an bo ..- bm) 
g7(t-1)=(-v(@- 1)... —y(t—n) u(t—do) ... u(t—do—m)) 
The least-squares estimator with exponential forgetting is given by 
A(t) = A(t -1) + K (Hele) | 
e(t) = y(t) — 97 (t— 1)Ê(t — 1) 
K(t) = P(t- Ijot- 1) (4+7 (t-1)PE-— Dolt- 1) 
P(t) = (1- Kp" (t- 1) PE- 1)/å 


(3.22) 


(Compare with Eq. (2.21).) If the input signal to the process is sufficiently 
exciting and the structure of the estimated model is compatible with the pro- 
cess, the estimates will converge to their true values. It takes max(n,m + do) 
sampling periods before the regression vector is defined. In the determinis- 
tic case it takes at least n + m +1 additional sampling periods to determine 
the n+ m + 1 parameters of the model, assuming that the process input is 
persistently exciting. It thus takes at least 


N=n+m+1+4max(n,m 4+ do) (3.23) 


sampling periods for the algorithm to converge. With recursive least squares 
initialized with a large P-matrix it may take a few more steps. Since the process 
input is generated by feedback, it may be difficult to assert that it is persistently 
exciting. Presence of process noise may also make convergence much slower. 
Convergence issues will be discussed further in Chapter 6. 


An Indirect Self-Tuner 


Combining the recursive least squares (RLS) estimator given by Eqs. (3.22) 
with the minimum-degree pole placement method (MDPP) for controller design 
given by Algorithm 3.1, we obtain the following self-tuning regulator. 


ALGORITHM 3.2 Indirect self-tuning regulator using RLS and MDPP 


Data: Given specifications in the form of a desired closed-loop pulse transfer 
operator Bm/Am and a desired observer polynomial Ao. 


Step1: Estimate the coefficients of the polynomials A and B in Eq. (3.1) using 
the recursive least-squares method given by Eqs. (8.22). 


Step 2: Apply the minimum-degree pole placement method given by Algo- 
rithm 3.1 where polynomials A and B are the estimates obtained in Step 1. 
The polynomials R, S, and T of the control law are then obtained. 
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Step 3: Calculate the control variable from Eq. (3.2), that is, 
Ru(t) = Twe(t) — Sy(2) 


Repeat Steps 1, 2, and 3 at each sampling period. Notice that there are some 
variations in the algorithm depending on the cancellations of the process zeros. 
Also notice that it is not necessary to perform Steps 1 and 2 at each sampling 
interval. oO 


Examples 


The properties of indirect self-tuning regulators are illustrated by the following 
two examples. 


EXAMPLE 3.4 Indirect self-tuner with cancellation of process zero 
Let the process be the same as in Example 3.1 and assume that the process 
zero is canceled. The specifications are the same as in Example 3.1, that is, 
to obtain a closed-loop characteristic polynomial Am. The parameters of the 
model 

y(t) + ary(t — 1) + aox(t — 2) = bou(t — 1) + byu(t — 2) 
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Figure 3.4 Output and input in using an indirect self-tuning regulator to 
control the system in Example 3.1. Notice the “ringing” in the control signal 
due to cancellation of the zero at —0.84. 
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Time 
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Time 
Figure 3.5 Parameter estimates corresponding to the simulation in Fig. 3.4. 
The true parameters are shown by dashed lines. 


which has the same structure as Eq. (3.17), are estimated by using the least- 
squares algorithm. Algorithm 3.2 is used for the self-tuning regulator. The 
calculations, which were done in Example 3.1, give the control law 


u{é) + riu(t — 1) = tou,(t) — soy(é} — siy(é — 1) 


The controller parameters were expressed as functions of the model parameters 
and the specifications. Figure 3.4 shows the process output and the control 
signal in a simulation of the process with the self-tuner when the command 
signal is a square wave. The output converges to the model output after an 
initial transient, The control signal has a severe oscillation (“ringing”) with a 
period of two sampling periods. This is due to the cancellation of the process 
zero at z = —bı/bo = —0.84. This oscillation is a consequence of a bad choice 
of the underlying design methodology. The initial transient depends critically 
on the initial values of the estimator. In this particular case these values were 
&;(0) = &2(0) = 0, 6o(0) = 0.01, and 6,(0) = 0.2. Notice that it is necessary 
that bo # 0. (Compare with Example 3.1.) The initial covariance matrix was 
diagonal with P(1,1) = P(2,2) = 100 and P(3,3) = P(4,4) = 1. The reason for 
using different values for parameters 4; and 6; is that these parameters differ 
by an order of magnitude. 

The parameter estimates are shown in Fig. 3.5. The behavior of the esti- 
mates depends critically on the initial values of the estimator. Notice that the 
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estimates converge quickly. They are close to their correct values already at 
time t = 5. The estimates obtained at time t = 100 are 

&,(100) = —1.60 (—1.6065) bo(100) = 0.107 (0.1065) 

42(100) = 0.60 (0.6065) (100) = 0.092 (0.0902) 


These values are quite close to the true values, which are given in parentheses. 
The controller parameters obtained at time ¢ = 100 are 


r1(100) = 0.85 (0.8467) to(100) = 1.65 (1.6531) 
so(100) = 2.64 (2.6852) s(100) = —0.99 (—1.0821) 


The system in Example 3.4 behaves quite well, apart from the “ringing” 
control signal. This can be avoided by using a design in which the process zero 
is not canceled. The consequences of this are illustrated in the next example. 


i 


i; EXAMPLE 3.5 Indirect self-tuner without cancellation of process zero 


Consider the same process as in Example 3.4, but use a control design in 
which there is no cancellation of the process zero. The parameters are estimated 
in the same way as in Example 3.4, but the control law is now computed as in 
Example 3.2. Polynomial A, is of first order. As in the previous examples the 
initial transient depends critically on the initial state of the recursive estimator. 


0 y 
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Figure 3.6 Same as in Fig, 3.4 but without cancellation of the process zero. 
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Figure 8.7 Parameter estimates corresponding to the simulation in Fig. 3.6. 
The true parameter values are indicated by dashed lines. 


For the design calculation it must be required that initial values are chosen so 
that polynomials A and B do not have a common factor. In this case the initial 
estimates were chosen to be 4;(0) = &2(0) = 0, 60(0) = 0.01, and 6,(0) = 0.2. 
The P-matrix was initialized as a diagonal matrix with P(1,1) = P(2,2) = 100 
and P(3,3) = P(4,4) = 1 as in Example 3.4. Figure 3.6 shows results of a 
simulation of the direct algorithm with a, = 0. Notice that the behavior of the 
process output is quite similar to that in Fig. 3.4 but that there is no “ringing” 
in the control signal. The parameter estimates are shown in Fig. 3.7. The values 
obtained at time t = 100 are 


â (100) = —1.57 (—1.6065) &o(100) = 0.092 (0.1065) 
(100) = 0.57 (0.6065) &,(100) = 0.112 (0.0902) 


The true values are given in parentheses. The controller parameters at time 
t = 100 are 


rı(100) = 0.114 (0.1111) fg(100) = 0.86 (0.8951) 
so(100) = 1.44 (1.6422) sı(100) = —0.58 (—0.7471) 


A comparison of Fig. 3.5 and Fig. 3.7 shows that it takes significantly longer for 
the estimates to converge when no zero is canceled. The reason for this is that 
the excitation is not as good as when there was “ringing” in the control signal. 
There is very little excitation of the system in the periods when the output 
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and the control signals are constant. This explains the steplike behavior of the 
estimates. 

It may seem surprising that the controller already gives the correct steady- 
state value at time ¢ = 20 when the parameter estimates differ so much from 
their correct values. The controller parameters are 


rı(20) = 0.090 (0.1111) to(20) = 0.88 (0.8951) 
so(20) = 1.13 (1.6422)  sı(20) = ~0.29 (—0.7471) 


Since the process has integral action, we have A(1)} = 0. It then follows from 
Eq. (3.3) that the static gain from command signal to output is 


BOTU) _ PQ) 
A(1)R(1) + BQ)S(Q1) — S(1) 
To obtain the correct steady-state value, it is thus sufficient that the controller 
parameters are such that S(1) = T(1), which in the special case is the same 
as tg = so + s1. When no poles are canceled, it follows from Eq. (8.12) that 
An(1) 


T0) = Ao(1)By(1) = AD Ee 


where B is the estimated B polynomial. Hence 


TQ) _ Ao(1An(l) _ 
S(1) B(1)S(1) 


where the last equality follows from Eq. (3.11). Notice that we have A(1) = 0. 
We thus obtain the rather surprising conclusion that the adaptive controller in 
this case will automatically have parameters such that there will be no steady- 
state error. o 


These examples indicate that the indirect self-tuning algorithm behaves as 
can be expected and that the estimate of convergence time given by Eq. (3.23) 
is reasonable. The examples also show the importance of using a good un- 
derlying control design. With model-following design it is recommended that 
cancellation of process zeros is avoided. 


Summary 


The indirect self-tuning regulator based on model-following given by Algo- 
rithm 3.1 is a straightforward application of the idea of self-tuning. The adap- 
tive controller has states that correspond to the parameter estimate 6, the 
covariance matrix P, the regression vector o, and the states required for the 
implementation of the control law. The controller in Example 3.4 has 20 state 
variables; updating of the covariance matrix P alone requires ten states. The 
complete codes for the controllers in the examples are listed in the problems 
at the end of this chapter. 
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The algorithm can be generalized in many different ways by choosing other 
recursive estimation methods and other control design techniques. The idea is 
easy to apply. A detailed discussion of practical implementation is given in 
Chapter 11. 


3.4 CONTINUOUS-TIME SELF-TUNERS 


Continuous-time self-tuners can be derived in the same way as discrete-time 
self-tuners. To show this, consider a system that can be described by the model 
(3.1) with v = 0, that is, 


A(p) y(t) = B(p)u(t) 
where A(p) and B(p) are polynomials in the differential operator, p = d/dt: 


A(p) = po +p +--+ an 

B(p) = bip +--- + by, 
A self-tuning regulator can be obtained by applying Algorithm 3.1. The only 
complication is that we now must apply recursive least-squares estimation to 
the continuous-time model. This was discussed in Section 2.3. Let us recall the 
key idea. Since it is undesirable to take derivatives, a stable filtering transfer 


function Hp with a pole excess of n or more is introduced. 
If we introduce the filtered signals 


yet) = Hy) urli) = Ayu) 


the model (3.1) can be written as 


p"y,(t) = 97 (10 
where 
plt) = (pys =- yp ph lug = up) 
O=(a, = an by = ba) 


Ey using least squares with exponential forgetting the parameter estimate is 
then obtained from Theorem 2.5: 


IO L Pol (AD -oO 


ZO = aP(t) — P(t)olt)o" (EPU) 


We illustrate the procedure given by Algorithm 3.1 by an example. 
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`- EXAMPLE3.6 Continuous-time self-tuner 


Consider the system in Example 3.3, in which the process has the transfer 


function 
b 


s(s +a) 
with a = 1 and b = 1. Notice that the process has only two unknown parame- 
ters, a and b. The regressor filters in the estimator are chosen to be 


G(s) = 


1 
H;(s) = ——~ 
f ( ) An(s) 
Furthermore, we use an estimator without forgetting, that is, a = 0. Assume 
that it is desired to obtain a closed-loop system with the transfer function 


m? 


Gn(s) = s? + 20 @s + a? 


The observer polynomial is chosen to be A,(s) = s +a, with a, = 2. The 
specifications are the same as in Example 3.4, that is, ¢ = 0.7 and w = 1. In 
Example 3.3 we solved the design problem when the parameters a and b are 
known. We found that the controller has the form 


Sop + $1 tolp + ao) 
u(t) = = ylti) + — ——— Helt 
() =- + E uef) 
1 
0 le y 
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Figure 3.8 Output and input when using a continuous-time indirect self 
tuning regulator to control the process in Example 3.6, 


3.4 Continuous-Time Self-tuners 111 


0 5 10 15 20 
Time 


Figure 3.9 Continuous-time parameter estimates corresponding to the sim- 
ulation in Fig. 3.8. The lower part shows the estimates in an extended time 
scale. 


where the controller parameters are given by 


ry =2f0+a,-—a 
G26 @ + 0 — ary 


So b 
oa, 
$1 = b 
w? 
n= 


Figure 3.8 shows the process output and the control signal in a simulation. 
The initial transient depends critically on the initial values of the estimator. 
In this case we have chosen 4(0} = 2 and 6(0) = 0.2. The initial covariance is 
diagonal with P(1,1) = P(2,2) = 100. The parameter estimates are shown in 
Fig. 3.9. The estimates obtained at £ = 100 are 


â(100) = 1.004 (1.0000) (100) = 1.001 (1.0000) 
where the true values are given in parentheses. Notice that only two param- 


eters are estimated in this case, whereas four parameters were estimated in 
Examples 3.4 and 3.5. m 


112 Chapter 3 Deterministic Self-tuning Regulators 


3.55 DIRECT SELF-TUNING REGULATORS 


The design calculations in the indirect self-tuners may be time-consuming and 
poorly conditioned for some parameter values. It is possible to derive other 
algorithms in which the design calculations are simplified or even eliminated. 
The idea is to use the design equations to reparameterize the model in terms 
of the parameters of the controller. This reparameterization is also the key 
to understanding the relations between model-reference adaptive systems and 
self-tuning regulators. 
Consider a process described by Eq. (3.1) with v = 0, that is, 


Ay(t) = Bu(e) 
and let the desired response be given by Eq. (3.5): 
Anymn(t) = Br, (t) 


The process model will now be reparameterized in terms of the controller 
parameters. To do this, consider the Diophantine equation (3.11), 


A.A, = AR’ + BS 
as an operator identity, and let it operate on y(t). This gives 
A,Amy(t) = R’ Ay(t} + B-Sy(t) = R'Bu(t) + BSy(t) 
It follows from Eq. (3.10) that 
R'B = R' B+B- = RB” 
Hence 
AcAmy(t) = B7 (Ru(t) + Sy(t)) (3.24) 


Notice that this equation can be considered a process model that is parameter- 
ized in the coefficients of the polynomials B-, R, and S. If the parameters in 
the model given by Eq. (3.24) are estimated, the control law is thus obtained 
directly without any design calculations. Notice that the model Eq. (3.24) is 
nonlinear in the parameters because the right-hand side is multiplied by B-. 
The difficulties caused by this can be avoided in the special case of minimum- 
phase systems in which BT = bo, which is a constant. 


Minimum-Phase Systems 


If the process dynamics is minimum phase, we have deg A, = deg A—deg B —1, 
B- is simply a constant, and Eq. (8.24} becomes 


AmAoy(t) = bo (Ru(t) + Sy(t)) = Ru(é) + $y(t) (3.25) 
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where R is monic, R = boR, and S = oS. Since R and Š differ only by R 
being monic, we will not use a separate notation in the following discussion. 
When it is necessary, we will simply note whether or not R is monic. 
When all process zeros are canceled, it is also natural to choose specifica- 
tions so that 
Bm = 07” Am(1) 


where dy = deg A — deg B. This gives response with minimal delay and unit 
static gain. 
By introducing the parameter vector 


G=(ro ... Te So ... Se) 


and the regression vector 


p(t) = (ult) u- y(t)... ylt-9) 
the model given by Eq. (3.25) can be written as , 


n(é) = A; (q) At, (a) yE) = g7 (E — oO (3.26) 


Since 7(¢) can be computed from y(t), it can be regarded as an auxiliary output, 
and a recursive estimate of the parameters can now be obtained as described 
in Chapter 2. 

This estimation method works very well if there is little noise, but the 
operation A*(q~!)A*,(¢-")y(t) may amplify noise significantly. The following 
method can be used to overcome this. Rewrite Eq. (3.25) as 


Ht) = 4 4 (Rult) +S) = Rup- do) +S*y/(t—do) (8.27) 
where 1 
(ORDA) O 3.29 
= RED" 


and dọ = deg A — deg B. We have further assumed that deg R = degS = 
deg(A,Ap) — do = £. Equation (3.27) can be used for least-squares estimation. 
If we introduce 
O= (ro ... Te So .-- se} 
and 
p(t) = (ust) ... uslt—2) yet)... ye(t—4)) 
it can be written as 
y(t) = v7 (t — do) 0 


The estimates are then obtained recursively from Eqs. (3.22). The following 
adaptive control algorithm is then obtained. 
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ALGORITHM 3.3 Simple direct self-tuner 


Data: Given specifications in terms of Am, Bm, and A, and the relative 
degree do of the system. 


Step 1: Estimate the coefficients of the polynomials R and S in the model 
(3.27), that is, 
y(t) = Bru s(t — do) + S*y¢(t — do) 


by recursive least squares, Eqs. (3.22). 


Step 2: Compute the control signal from 
R’u(t) = T*u,(t) — S*y(t) 
where R and S are obtained from the estimates in Step 1 and 


T* = A*A,, (1) (3.29) 
with deg A, = dy — 1. Repeat Steps 1 and 2 at each sampling period. O 


Equation (3.29) is obtained from the observation that the closed-loop trans- 
fer operator from command signal u, to process output is 


TB _ TbBt =i 
AR+BS ~~ boA,AnBt A Am 


Requiring that this be equal to g% Am(1)/Am gives T = g”-“A,,(1)A, which 
implies Eq. (3.29). 

Remark 1. A comparison with Algorithm 3.2 shows that the step correspond- 
ing to control design is missing in Algorithm 3.3. This motivates the name 
“direct algorithm.” 

Remark 2. Notice that it is necessary to know the relative degree do of the 
plant @ priori. 

Remark 3. The polynomials R and 5 contain the factor bo. Notice that the 
polynomial R is not monic and that the parameter ro must be different from 
zero. Otherwise, the control law given by Eq. (3.2) is not causal. Since dp 
is the relative degree of the plant, the true value of ro = bo is different from 
zero. Any consistent estimate of the parameter will thus be different from zero. 
The estimate obtained for finite time may, however, be zero. In practice it is 
therefore essential to take some precautions. 


Remark 4. Notice that the assumption B7 = bp implies that all process zeros 
are canceled. This is the reason why the algorithm requires the plant to be 
minimum phase. 


Examples 


Properties of direct. self-tuners will now be illustrated by some examples. 
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EXAMPLE3.7 Direct self-tuner with dọ = 1 
Consider the system in Example 3.1. Since degA = 2 and deg B = 1, 
we have deg Am = 2 and degA, = 0. Hence A, = 1, and we will choose 
Bm = gA,,(1). Equation (3.29) in Algorithm 3.3 then gives T = gA,,(1). The 
controller structure is given by deg R = degS = deg T = deg A — 1 = 1. The 
model given by Eq. (3.27) therefore becomes 


y(t) = rows(¢ — 1) + riuplt — 2) + soye(t — 1) + siyz(t — 2) (3.30) 


where 
ý up (t) + amilt — 1) + amur(t— 2) = u(t) 
yp (t) + Omiys(t — 1) + Omiys (tf — 2) = y(t) 


It is now straightforward to obtain a direct selftuner by applying Algo- 
rithm 3.3. The parameters of the model given by Eq. (3.30) are thus estimated, 
and the control signal is then computed from 


Fou(t) + Pyu(t — 1) = tou,(t) — Soyle) — Sylt — 1) 


where Fo, #1, ŝo, and 8; are the estimates obtained and % is given by Eq. (3.29), 
that is, 
fo = 1 4 Omi + m2 
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Figure 3.10 Command signal u., process output y, and control signal u 
when the process given by Eq. (3.16) is controlled by using a direct self-tuner 
with do = 1. Compare with Fig. 3.4. 
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Figure 3.11 Parameter estimates corresponding to the simulation shown 
in Fig. 3.10: ĉi /fo (solid line), f/f, (dashed line), ŝ0/fo (dash-dot lne), 3,/7p 
(dotted line). 


Notice that the estimate of ro must be different from zero for the controller to 
be causal. 

Figure 3.10 shows the process inputs and outputs in a simulation of the 
direct algorithm, and Fig. 3.11 shows the parameter estimates. The initial 
transient depends strongly on the initial conditions. At t = 100 the controller 
parameters are 


aaa: = 0.860 (0.8467) ha =1.65 (1.6531) 

$ 0 

&o(100} _ (100) _ 

Fo(100) 7268 (2.6852) (100) = 193 (—1.0321) 


The controller parameters are divided by fp to make a direct comparison with 
Examples 3.1 and 3.3. The correct values are given in parentheses. A compar- 
ison of Fig. 3.4 and Fig. 3.10 shows that the direct and indirect algorithms 
have very similar behavior. The limiting contro] law is the same in both cases. 
There is “ringing” in the control signal because of the cancellation of the pro- 
cess zero. E 


In a practical case the time delay and the order of the process that we 
would like to control are not known. It is therefore natural to consider these 
variables as design parameters that are chosen by the user. The parameter do 
is of particular importance for a direct algorithm. In the next example we show 
that “ringing” can be avoided simply by increasing the value of dp. 


* EXAMPLE3.8 Direct self-tuner with do = 2 


In the derivation of the direct algorithm the parameter dp was the pole excess 
of the plant. Assume for a moment that we do not know the value of do and that 
we treat it as a design parameter instead. Figure 3.12 shows a simulation of 
the direct algorithm used in Example 3.7 but with dọ = 2 instead of dp) = 1. All 
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Figure 3.12 Command signal ue, process output y, and control signal u 
when the process described by Eq. (3.16) is controlled with a direct self-tuner 
with dy = 2. 


the other parameters are the same. Notice that the behavior of the system is 
quite reasonable without any “ringing” in the control signal. Figure 3.13 shows 
the parameter estimates. The estimates obtained at time ¢ = 100 correspond 
to the controller parameters 


f1(100) __g.gq7 COO iog END y, ĉo(100) _ 50 
ĵo (100) fo(100) fo(100) Fo{100) 
$1/Fo 
15 20 
Time 


Figure 3.13 Parameter estimates corresponding to Fig. 3.12: 7, /> (solid 
line), fo/7> (dashed line), ŝ0/fo (dash-dot line), 1/7) (dotted line). 
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We thus find the interesting and surprising result that cancellation of the 
process zero can be avoided by increasing the parameter do. This observation 
will be explained later when we will be analyzing the algorithms. o 


Feedforward Control 


A nice feature of the direct self-tuner is that it is easy to include feedforward. 
Let v be a disturbance that can be measured. By estimating parameters in the 


model 1 
y(i) = LA, (Ru(t) + Sy(t}— Uv(t)) (3.31) 


and using the control law 
Ru(t) = Tu,(t) — Sy(t) — Uv(t) 


we obtain a self-tuning controller that combines feedback and feedforward. The 
term Tue in the control law can also be viewed as a feedforward term. 

In Algorithm 3.3, polynomials R and S are estimated and the polynomial T 
is computed. This means that the different terms of the control law are treated 
differently. It is possible to obtain an algorithm in which all coefficients of the 
control law are estimated by treating Tu, as a feedforward term that is adapted. 
To do this, we first notice that the desired response is given by 


ynl) = Fe welt) = Za tet) 
It follows from Eq. (3.27) that error e(t) = y(t) — ym(t) is given by 


1 
e(t) = ga (Rule) + So(t) ~ Tue() 
= R'u s(t — do) + S*y¢(t — do) — T'uep(t — do) (3.32) 
where up, yf, and uep are the filtered signals defined by Eqs. (3.28) and 


1 
MO Bay “0 


Furthermore, deg T = deg R = deg S = deg(A,A,,}—do and deg Am —deg Bm = 
do. An algorithm that is analogous to Algorithm 3.3, in which the parameters 
of the feedforward polynomial T are also estimated is now easily obtained by 
estimating the parameters in Kq. (3.32). 


Non-minimum-Phase (NMP) Systems 


The case in which process zeros cannot be canceled will now be discussed. 
Consider the transformed process model Eq. (3.24), that is, 


AvAmy(t) = B~ (Ruft) + Sy) 
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where deg R = deg S = deg(A,A,,) — deg B~. If we introduce 
R= BR and S =BS 


the equation can be written as 


y(t) = aa (Ru(t) + Sy(t)) = R'up(t — do) + S*yp(t— do) (3-33) 


where uç and yç are the filtered inputs and outputs given by Eqs. (3.28). 
Notice that the polynomial R is not monic. The polynomials ® and S have 
a common factor, which represents poorly damped zeros. This factor should 
be canceled before the control law is calculated. The following direct adaptive 
control algorithm is then obtained. 


ALGORITHM 3.4 Direct self-tuning regulator for NMP systems 


Data: Given specifications in terms of Am, Bm, and A, and the relative degree 
dy of the system. 


Step 1: Estimate the coefficients of the polynomials R and S$ in the model of 
Eq. (3.33) by recursive least squares. 


Step 2: Cancel possible common factors in Rand S to obtain R and S. 


Step 3: Calculate the control signal from Eq. (8.2) where F and S are those 
obtained in Step 2 and T is given by Eq. (3.12). 
Repeat Steps 1, 2, and 3 at each sampling period. o 


This algorithm avoids the nonlinear estimation problem, but more pararme- 
ters have to be estimated than when Eq. (3.24) is used because the parameters 
of the polynomial B- are estimated twice. The estimation is straightforward, 
however, because the model is linear in the parameters. The Euclidean algo- 
rithm in Chapter 11 can be used in Step 2 to eliminate common factors of 
polynomials R and S. This step is crucial because an unstable common factor 
may cause instabilities. 

Calculation of polynomial 7 should be avoided. To do this, notice that 


B-B; 
Ym = An 
The error e = y — Ym can then be written as 
B- 
e(t) = 4 g (Ru(t) + Sylt) — Tue (t) 
= Rup(t — do) + S*yr(t — do) — Tuer(t — do) (3.34) 


By basing parameter estimation on this equation, estimates of polynomials &, 
S, and T can be determined. Notice that to estimate coefficients of T, it is 
necessary that the command signal be persistently exciting. 
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Mixed Direct and Indirect Algorithms 


Another direct algorithm can be derived in the particular case in which no 
process zeros are canceled. In this case we have B = B, and the model 
Eq. (3.24) becomes 


A,Any(t) = B(Ru(t) + Sy(t)) 
which can also be written as 


y(t) = q (Ru(t) + Sy(0)) = BY (R'up(t—do) + Sty¢(t—do)) (8.35) 


The following algorithm is a hybrid algorithm that combines features of direct 
and indirect schemes. 


ALGORITHM 35 A hybrid self-tuner 
Data: Given polynomials A, and Am. 
Step 1: Estimate parameters of polynomials A and B in the model 


Ay = Bu 


Step 2: Estimate parameters of polynomials F and S in Eq. (3.35) where B 
is the estimate obtained in Step 1. 


Step 3: Use the contro] law 
Ru = Tu, — Sy 
where R and S are obtained from Step 2 and T = toA, where 
B(1) 


o 
Remark 1. Instead of being computed, polynomial T can also be estimated by 
replacing Step 2 by the folowing step: 


Step 2 Estimate parameters of polynomials R and S and to from the model 


B 
e(t) = y(t) — Ym (t) = ZA (Rutt) + Sy) — toActe(?)) 
= B* (R'u p(t — do) + S*y¢(t — do) — toAztucs(t — do)) (3.36) 
where B is the polynomial obtained in Step 1. It is then assumed that deg A,, = 
deg A. 


Remark 2. Instead of the Diophantine equation being solved at each step, two 
process models are estimated. This implies that an additional iteration of the 
least-squares estimator has to be done at each sampling time. 
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3.6 DISTURBANCES WITH KNOWN CHARACTERISTICS 


So far, we have concentrated on servo problems that are common in aerospace 
and mechatronics. In process control, regulation problems are more common. 
It is then important to consider attenuation of disturbances that act on the 
process. The disturbance may enter the process in many different ways. For 
simplicity we will assume that it enters at the process input as shown in 
Figure 3.2. This assumption is not very restrictive. If the disturbance is denoted 
by v, the system is then described by Eq. (3.1). We will first use an example to 
illustrate that load disturbances will cause problems. 


EXAMPLE 3.9 Effect of Load Disturbances 


Consider the system in Example 3.5, that is, an indirect self-tuning regulator 
with no zero cancellation. We will now make a simulation that is identical to 
the one shown in Fig. 3.6 except that the load disturbance will be u(t) = 0.5 for 
t > 40. A forgetting factor A = 0.98 has also been introduced; otherwise, the 
conditions are identical to those in Example 3.5. The behavior of the system 
is shown in Fig, 3.14. Compare Fig. 3.14 with Fig. 3.6. Figure 3.14 shows that 
a load disturbance may be disastrous. It follows from the discussion in Exam- 
ple 3.5 that the correct steady-state value will always be reached provided that 
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Figure 3.14 Output and control signal when for a system with an indirect 
self-tuner without zero canceling when there is a load disturbance in the form 
of a step at the process input at time £ = 40. 
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Figure 3.15 Parameter estimates corresponding to Fig. 3.14. 


the steps are sufficiently long. Notice that the response is strongly asymmetric. 
The reason for this is that the controller parameters change rapidly when the 
control signal changes; see Fig. 3.15, which shows the parameter estimates. 
Rapid changes of the estimates in response to command signals indicates that 
the model structure is not correct. The parameter estimates also change sig- 
nificantly at the step in the load disturbance. When the command signal is 
constant, the parameters appear to settle at constant values that are far from 
the true parameters. m 


There are many ways to deal with disturbances. The internal model prin- 
ciple is used in this section. An alternative is to estimate the disturbance and 
compensate for it in a feedforward fashion. An in-depth discussion of different 
methods and their advantages and disadvantages is found in Chapter 11. 


A Modified Design Procedure 


The pole placement procedure can be modified to take disturbances into ac- 
count, In many cases the important disturbances have known characteristics. 
This can be captured by assuming that the disturbance v in the model (3.1) is 
generated by the dynamical system 


Agu=e (3.37) 
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where e is a pulse, a set of widely spread pulses, white noise, or the equivalent 
continuous-time concepts. For example, a step disturbance is generated in 
discrete-time systems by 


Aa(q)=q—-1 
and in continuous-time systems by 
Aa(p) =p 
With the controller of Eq. (3.2) we find 


_ BT, BR , 
V= AR+ BS“ (AR + BS) 


_ AT BS, 
 AR+BS © Aa(AR + BS) 


(3.38) 


u 


The closed-loop characteristic polynomial thus contains the disturbance dynam- 
ics as a factor. This polynomial typically has roots on the stability boundary 
or in the unstable region. It follows from Eqs. (3.38) that to maintain a finite 
output in case of these disturbances, Ag must be a factor of R. This would 
make y finite, but the controlled input u may be infinite. This is, of course, 
necessary to compensate for an infinite disturbance. 

It has already been mentioned that the Diophantine equation has many 
solutions. Compare with Eqs. (3.14). If R” and S° are solutions to the Diophan- 
tine equation 

AR” + BS? = A? 


it follows that 
R=XR°+YB 


3.39 
S=XS°-YA (3-39) 


satisfies the equation 
AR + BS=XA? 


If a controller R° S° that gives the characteristic polynomial A? has been 
obtained, we can thus obtain a controller with characteristic polynomial X A9} 
by using the controller (3.39). Suppose that we have designed a controller R° 
and 8° and that we would like to have a new controller in which R = R’A,. We 
then choose a stable polynomial X that represents the additional closed-loop 
poles, and we determine Ff’ and Y such that 


R= AgR' =XR°+YB (3.40) 


The new controller is then given by Eqs. (3.39). 
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Integral Action 


In the special case in which the disturbance is a constant, that is, Ag = q — 1, 
we have to add an additional closed-loop pole. Hence 


xX =G+Xo 
and Eq. (3.40) becomes 
(q—1)R’ = (q + xo)R° + yoB 
Putting q = 1 gives one equation to solve for yo. Hence 


_ (+0) R°(1) 
GME o) ae 


Inserting X and Y = y into Eqs. (3.39) gives the new controller. 


(3.41) 


Modifications of the Estimator 


Disturbances will change the relations between the inputs and the outputs 
in the model. Load disturbances such as steps will have a particularly bad 
effect on the low-frequency properties of the model. Several ways to deal with 
this problem are discussed in Section 11.5. One possibility is to include the 
disturbance in the model and estimate it; another, which we will use here, is 
to filter the signal so that the effect of the disturbance is not so large. In the 
model given by Eq. (3.1) the equation error is B(q)v. This could be a very large 
quantity if B(1) 4 0 and v is a large step. If the disturbance v in Eq. (3.1) can 
be described by Eq. (8.37) we find that Eq. (3.1) can be written as 


AgAy(t) = AgB(u(t) + v(t) = AgBu(t) + e(t) 


Hence 
Ayp(t) = Buy(t) + e(t) (3.42) 


By introducing the filtered signals ys = Agy and us; = Agu we thus obtain a 
model in which the equation error is e instead of v, where e is significantly 
smaller than v. For example, if v is a step and Ag = q — 1 as in Example 3.9, 
we find that e is zero except at the time where the step in v occurs. 

The next example shows that the difficulties encountered in Example 3.9 
can be avoided by using a self-tuner with a modified estimator and a modified 
control design. 


’ EXAMPLE 3.10 Load disturbances: Modified estimator and controller 

We now show that the difficulties found in Example 3.9 can be avoided by 
modifying the estimator and the controller. We first introduce a controller 
that has integral action by applying the design procedure that we have just 
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Figure 3.16 Output and control signal with an indirect self-tuner with 
integral action and a modified estimator. 


described. To do this, we consider the same system as m Example 3.5 where 
the controller was defined by 


R°=qt+r, S°=s0q4+51 


The closed-loop characteristic polynomial A, has degree three. To obtain a 
controller with integral action, the order of the closed-loop system is increased 
by introducing an extra closed-loop pole at g = —xo = 0. It then follows from 
Eq. (3.41) that 

1+ry 

bo + bi 


Hence X = q and Y = yo, and Eqs. (3.39) now give 


R = qalq + rı) + yo(bog + bı) = (q — 1)(¢ — biyo) 
S = q(sog + $1) — yolg” + arg + a2) = (80 — yo)}g? + (81 — a1yo)q — a2y0 


The estimates are based on the model (3.42) with Ag = q — 1 to reduce the 
effects of the disturbances. Figure 3.16 shows a simulation corresponding to 
Fig. 3.14 with the modified self-tuning regulator. A comparison with Fig. 3.14 
shows a significant improvement. The load disturbance is reduced quickly. 
Because of the integral action the control will decrease with a magnitude 
corresponding to the load disturbance shortly after ¢ = 40. The parameter 
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Figure 3.17 Parameter estimates corresponding to Fig. 3.16. 


estimates are shown in Fig. 3.17, which indicates the advantages in using the 
modified estimator. Notice in particular that there is a very small change in 
the estimates when the load disturbance occurs. oO 


A Direct Self-tuner with Integral Action 


It is also straightforward to introduce integrators in the direct self-tuners. 
Consider a process model given by 


A(g)y(t) = B(g)(u@) + v(t) (3.43) 


where d = deg A(q) — deg B(q). It is assumed that v is constant or changes 
infrequently. Let the desired response to command signals be given by 


Am(Q)y(é) = Am(1)uc(t — d) (3.44) 


where deg Am > d. Let the observer polynomial be A,(q). The design equation 
is 
AR + BS = BtA,An (3.45) 


where B = b Bt. If we require that the regulator has integral action, we find 
that the polynomial R has the form 


R = R'Bt = RBt(q—1)=R,Bta (3.46) 
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Equation (3.45) then becomes 
AAR, + bos = Ap Am (3.47) 
Hence 
Ay Any = AR Ay + boSy 
= BR] Au + bK Av + bp Sy 
= bo(R'Au + Sy) + boR Av (3.48) 
where Eq. (3.43) was used to obtain the second equality. Notice that the last 


term will vanish after a transient if v is constant. If we rewrite Eq. (3.48) in 
the backwards operator, ignoring v, we get 


Ala) Anla ylit + d) = bo(R" (97) A* (Qu) + S*(Q) y(t) (8-49) 
This equation can be used as a basis for parameter estimation, but there are 
several drawbacks in doing so. First, the operation AjA*, is a high-pass filter 
that is very sensitive to noise. Furthermore, it follows from Eq. (3.47) that 

boS*(1) = A3(1)A% (1) = Ao(1)Am(1) (8.50) 


All the parameters in the S polynomial are thus not free. If all parameters are 
estimated, there is, of course, no guarantee that Eq. (3.50) holds. However, it 
is easy to find a remedy. A polynomial S* with the property given by Eq. (3.50) 
can be written as 
boS* = Ao(1)Am (1) + (1 — g’) S™ (a>) 
= Ao(1)Am(1) + S” (°A 
Equation (3.49) then becomes 
A(T AR lT YE + d) — Ao(1 Am (Dy(2) 
— bo (R" (q  )A*u(t) + S™ (q> )A*y(t)) 


= R (q NAtu() + S* (g Ayit) (3.51) 
Division by A*.A* now gives 
1 1 i efi 
td- gS OSR UOS (8.52) 
where i 4 
_ -qg 
"O= aa A 


1— qg! 
t) = ——— nn yl 
"O= A) 
Notice that the difference operation eliminates levels and that division by 
AzA», corresponds to low-pass filtering. Thus the net effect is that the signals 
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are band-pass filtered with filters that are matched to the desired closed-loop 
dynamics and the specified observer polynomial. 

To complete the algorithm, it now remains to specify how the control law is 
obtained from the estimated parameters. To obtain the response to command 
signals given by Eq. (3.44), it follows from Eq. (3.51) that 


Req ")A*u(t) + S*(g*)A* y(t) + Ao(LAm(L) y(t) = Ala) Am (1)ue (t) 


A controller with integral action may perform poorly if there are actuators that 
saturate. The feedback loop is broken during saturation, and the integrator 
may drift to undesirable values. This phenomenon, which is called windup, 
can be avoided if the control algorithm is modified to 


Ala?) (a(t) — An (Lue (é)) 
= — Ao(1)Am(1)y(t) — S*(a7 A‘) 


— (R (°A — A(T) )uld) 
u(t) =sat (£) 


(3.53) 


The windup phenomenon is discussed in detail in Section 11.2. In summary, 
Algorithm 3.6 is obtained. 


ALGORITHM 3.6 A direct self-tuning algorithm 
Step 1: Estimate the parameters in Eq. (3.52) by recursive least squares. 


Step 2: Compute the control signal from Eqs. (3.53) by using the estimates 
from Step 1. o 


This algorithm may be viewed as a practical version of Algorithm 3.3. 


3.7 CONCLUSIONS 


Deterministic self-tuning regulators have been developed in this chapter. The 
controllers may be viewed as an attempt to automate the steps of modeling 
and control design that are normally done by a control system designer. By 
specifying a model structure, modeling reduces to recursive parameter estima- 
tion. Control design results in a map from process parameters to controller 
parameters. Simple estimation methods (least squares) and simple control de- 
sign techniques (pole placement) have been used in this chapter. The control 
design was based on the certainty equivalence principle, which means that the 
uncertainties in the estimates are neglected in computing the control law. Two 
classes of algorithms have been discussed: indirect and direct algorithms. The 
indirect algorithms are a straightforward implementation in which process pa- 
rameters are estimated and the controller parameters are computed by using 
some design equations. In the direct algorithms the controller parameters are 
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estimated directly. To do this, design equations are used to reparameterize the 
process model in the controller parameters. This makes it possible to establish 
relations between MRAS and STR, as is discussed in Chapter 5. 


PROBLEMS 


3.1 


3.2 


3.3 


In sampling a continuous-time process model with h = 1 the following 
pulse transfer function is obtained: 


z+1.2 
B= z2 — 240.25 
The design specification states that the discrete-time closed-loop poles 
should correspond to the continuous-time characteristic polynomial 


s? +9841 


(a) Design a minimal-order discrete-time indirect self-tuning regulator. 
The controller should have integral action and give a closed-loop 
system having unit gain in stationary. Determine the Diophantine 
equation that solves the design problem. 

(b) Suggest a design that includes direct estimation of the controller 
parameters. Discuss why a well-working direct self-tuning regulator is 
more difficult to design for this process than is an indirect self-tuning 
regulator. 

Consider the process 
1 

s(s +a) 

where a is an unknown parameter. Assume that the desired closed-loop 


system is 


G(s) = 


w? 


32 + 20s + 0? 
Construct continuous- and discrete-time indirect self-tuning algorithms 
for the system. 


Consider the system 


G.,(s) = 


G(s) = Gi(s)Ge(s) 
where 


Gls) = s+a 
C 


Ge(s) = Seed 


where a and b are unknown parameters and c and d are known. Construct 
discrete-time direct and indirect self-tuning algorithms for the partially 
known system. 
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3.4 


3.5 
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A process has the transfer function 


G(s) = Gt 1) 


where b is a time-varying parameter. The system is controlled by a pro- 
portional controller 
u(t) = k (u-(t) — y(2)) 


It is desirable to choose the feedback gain so that the closed-loop system 
has the transfer function 


1 
s +s+1 
Construct a continuous-time indirect self-tuning algorithm for the sys- 
tem. 


The code for simulating Examples 3.4 and 3.5 is listed below. Study the 
code and try to understand the details. 


G(s) = 


DISCRETE SYSTEM reg 


"Indirect Self-Tuning Regulator based on the model 

" ë H(qg)=(b0*gt+bi)/ (q* 2+al*gta2) 

“using standard RLS estimation and pole placement design 
"Polynomial B is canceled if cancel>0.5 


INPUT ysp y "set point and process output 
OUTPUT u “control variable 

STATE ysp1 yi ul vi "controller states 

STATE thi th2 th3 th4 "parameter estimates 

STATE fi f2 f3 f4 "regression variables 

STATE pil pi2 p13 p14 “covariance matrix 

STATE p22 p23 p24 

STATE p33 p34 

STATE p44 


NEW nyspl nyi nui nvi 

NEW nthi nth2 nth3 nth4 

NEW nf1 nf2 nf3 nf4 

NEW nii ni2 ni3 ni4 n22 n23 n24 n33 n34 n44 
TIME t 

TSAMP ts 


INITIAL 

"Compute sampled Am and Ao 
a=exp(-z*wkh) 

am1=—2*a*cos (weh*sqrt (1-z*z)) 


Problems 


am2=a*a 
aop=IF w*To>100 THEN 0 ELSE -exp(-h/To) 
ao=IF cancel>0.5 THEN 0 ELSE -aop 


SORT 

"1.0 Parameter Estimation ; 
"1.1 Computation of P*f and estimator gain k 
pfi=pi11*f1+p12*f2+p13*f3+p14*f4 
pf2=pi2+f1+p22¥*f2+p23*f3+p24*f4 
pf3=p13¥*f1+p23*f2+p33*f3+p344f4 
pf4=p14+f1+p24*f2+p344*f3+p44*f4 
denom=lambdatf1*pf1+fi2*pf2+f£3*pi3+f4*pr4 
ki=pf1/denon 

k2=pf2/denom 

k3=pf3/denom 

k4=pf4/denom 


"1.2 Update estimates and covariances 
eps=y-f1*thi-f2*th2-f3*th3-f4*th4 
nthi=thi+tki*eps 

nth2=th2+k2*eps 

nth3=th3+k3*eps 

nth4=th4tk4*eps 
nii=(p11-pfi*k1)/lambda 
n12=(p12-pf1*k2) /lambda 
n13=(pi3-pf1*k3) /lambda 
ni4=(p1i4-pf1+#k4) /lambda 
n22=(p22-pf 2*k2) /lambda 
n23=(p23-pf2*k3) /lambda 
n24=(p24-pf2*k4) /lambda 
n33=(p33~pf3*k3) /lambda 
n34=(p34-pf3+k4) /lambda 
n44=(p44-pf4*k4) /lambda 


"1.3 Update and filter regression vector 
nf1i=-y 
nf2=f1 
nf3=u 
nf4=f3 


"2.0 Control design 
"2.1 Rename parameters 
al=nthi 

a2=nth2 


131 


132 Chapter 3 Deterministic Self-tuning Regulators 


bO=nth3 
bi=nth4 


"2.2 Solve the polynomial identity AR+BS=AcAm 
=b1*bl—-al*bO*bl1+a2*b0*b0 
ri0=(ao*am2*b0* 2+ (a2-—am2—ao*xam1) #b0*b1+ (aotami—-al) *b1~>2) /n 

wi=(a2*amn1l+a2*ao-al*a2—am2*ao0) *b0 

s00=(wi+ (-al*aml-al*ao-a2+al*2+am2+am1*ao) *b1) /n 

w2=(-al*am2*a0+a2*am2+a2*am1*a0-a2* 2) *bO 

s10=(w2+ (-a2*am1-a2*a0+al¥*a2+am2*ac) *b1) /n 


"2.3 Compute polynomial T=Ao*Am(1)/B(1) 
bs=b0+b1 

as=1+aml+am2 

bm0=as/bs 


"2.4 Choose control algorithm 

r1=IF cancel>0.5 THEN bi/bO ELSE r10 

sO=IF cancel>0O.5 THEN (ami-ai)/b0O ELSE s00 
si=IF cancel>0.5 THEN (am2-a2)/b0 ELSE s10 
tO=IF cancel>0.5 THEN as/b0O ELSE bm0 

ti=IF cancel>0.5 THEN 0O ELSE bm0x*ao 


"3.0 Control law with anti-windup 
v=-ao*rvilt+tO*xyspttl*yspl—s0*y-sl¥*y1+ (ao-ri) ul 
u=IF v<-ulim THEN -ulim ELSE IF v<ulim THEN v ELSE ulim 


"3.1 Update controller state 


ny1=y 

nui=u 

nvi=v 

nyspl=ysp 

"4.0 Update sampling time 

ts=tth 

"Parameters 

lambda: 1 "forgetting factor 

To:200 "observer time constant 
z:0.7 "desired closed loop damping 
wil “desired closed loop natural frequency 
h:l "sampling period 

ulin: 1 "limit of control signal 
cancel:1 "switch for cancellation 


thi:-2 "initial estimates 


3.6 


3.7 


3.8 


3.9 


3.10 


3-11 


3.12 
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th2:1 

th3:0.01 

th4:0.01 

pi1:100 “initial covariances 
p22:100 

p33: 100 

p44: 100 


END 


Consider the simulation of the indirect self-tuning regulator in Exam- 
ple 3.5. Investigate how the transient behavior of the algorithm depends 
on the initial values of 8 and P and the forgetting factor. 


Consider the indirect self-tuning regulator in Example 3.5. Make a sim- 
ulation over longer time periods, and investigate how the parameters ap- 
proach their true values. Also explore how the convergence rate depends 
on the forgetting factor A. 


Consider the indirect self-tuning regulator in Example 3.5. Show that no 
steady-state error is obtained if 


Gy + ĝ = 1 


Modify the simulation used to generate Figs. 3.6 and 3.7, plot the pa- 
rameter combination @ + @:, and check how well the above condition is 
satisfied. 


Consider the indirect self-tuning regulator in Example 3.5. Change the 
specifications on the closed-loop system, and investigate how the behavior 
of the system changes. 


Consider the indirect self-tuning regulator in Example 3.5. Modify the 
simulation program so that the parameters of the process can be changed. 
Investigate experimentally how well the adaptive system can follow rea- 
sonable parameter variations. 


Apply the indirect self-tuning regulator in Example 3.5 to a process with 
the transfer function i 
G(s) = (+1 


Study and explain the behavior of the error when the reference signal is 
a square wave. 


The code for simulating Example 3.6 is listed below. Study the code and 
try to understand all the details. 


CONTINUOUS SYSTEM reg 
"Continuous time STR for the system b/[s(s+a)] 
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"Desired response given by am2/(s"2+ami*stam2) 
"Observer polynomial s+ao 


INPUT y ysp 

OUTPUT u 

STATE yf yf1 uf uf1 xu 
STATE thi th2 

STATE p11 p12 p22 

DER dyf dyf1i duf dufi dxu 
DER athi dth2 

DER dpil dpi2 dp22 


"Filter input and output 
dyf=yfi 
dyfi=-ami*yf1+am2*(y-yf} 
duf=uf1 l 

duf 1=-anl»*uf 1+am2* (u-uf) 


"Update parameter estimate 
fi=-yfi 

f2=uf 
e=dyf1-f1+*thi-f2+*th2 
pfi=pii+fi+pi2*f2 
pf2=pi24f1+p22*f2 
dthi=pfix*e 

dth2=pf2*e 


"Update covariance matrix 
dpii=alpha*p1li-pfi*pf1 
dpi2=alpha*p12-pfl*pf2 
dp22=al pha*p22-pf 2*pf2 
det=p1i*p22-pi2*p1i2 


"Control design 

a=thl 

b=th2 

ri=amitao-a 
s0=(am2+taml*ao—a*ri)/b 
si=am2*ao/b 

tO=am2/b 


"Control signal computation 
dxu=-ao*xu- (s1—-ao*s0) *y+(ao-r1) *u 
v=t0*ysp-sO*y+xu 


3.13 


3.14 


3.15 
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n=if v<-ulim then -ulim else if v>ulim then ulim else v 


"Parameters 
ami:1.4 
am2:1 

alpha :0 
ao:2 

ulim:4 

END 


Consider the simulation of the continuous-time indirect self-tuning reg- 
ulator in Example 3.6. Investigate how the transient behavior of the al- 
gorithm depends on the initial values of 6 and P. 


Consider the indirect self-tuning regulator in Example 3.6. Make a simu- 
lation, and investigate how the convergence rate depends on the forgetting 
factor a. 


Consider the system in Problem 1.9. 

(a) Sample the system, and determine a discrete-time controller for the 
known nominal system such that the specifications are satisfied. 

(b) Use a direct self-tuning controller, and study the transient for differ- 
ent initial conditions and different values of the variable parameters 
of the system. 

(c) Assume that e = 0 and that w. is a square wave. Simulate a self- 
tuning controller for different prediction horizons. 

(d) Investigate the behavior when the disturbance d is a step. What 
happens when the controller does not have an integrator? 
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CHAPTER 4 


STOCHASTIC AND PREDICTIVE 
SELF-TUNING REGULATORS 


4.1 INTRODUCTION 


In Chapter 3 the key issue was to find self-tuning controllers that give de- 
sired responses to command signals. In this chapter we discuss self-tuners 
for the regulation problem. The key issue is now to design a controller that 
reduces disturbances as well as possible. Stochastic models are useful to de- 
scribe disturbances. For this reason we start in Section 4.2 by describing a 
simple stochastic control problem. This leads to a minimum-variance controller 
and its generalization, the moving-average controller. In Section 4.3 we present 
a direct adaptive controller that has the surprising property that the moving- 
average controller is an equilibrium solution. This surprising property was one 
of the motivating factors in the original work on the self-tuning regulator. The 
minimum-variance controller has the drawback that its properties are criti- 
cally dependent on the sampling period. In Section 4.4 some extensions are 
therefore presented. Linear quadratic Gaussian self-tuners are discussed in 
Section 4.5, and adaptive predictive control is discussed in Section 4.6. 


4.2 DESIGN OF MINIMUM-VARIANCE 
AND MOVING-AVERAGE CONTROLLERS 


In this section we derive controllers for linear stochastic systems. It is assumed 
that the process can be described by a pulse transfer function and that the 
disturbances acting on the system are filtered white noise. A steady-state 
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regulation problem is considered. The criterion is based on the mean square 
deviations of the output and the control signal. 


Process Model 
Assume that the process dynamics are characterized by 


x(t) = ri u(t) 


where A;(g) and 2,(g) are polynomials in the forward shift operator without 
any common factors. 

It is assumed that the action of the disturbances on the system can be 
described as filtered white noise. Since the system is linear, we can reduce all 
disturbances to an equivalent disturbance v at the system output. The output 
is thus given by 


y(t) = x(t) + v(t) 


where ala) 
q 
u(t) = ——— e(t 
(0) = 5 la) (£) 
Ci(q) and Ao(g) are polynomials in the forward shift operator without any 
common factors, and {e(t}} is a sequence of independent random variables 
(white noise) with zero mean and standard deviation o. 
The process can now be reduced to the standard form 


A(q)y(t) = B(q)ju() + C(g)e() (4.1) 
where 
A= Alg 
B= BAe (4.2) 
C = CA 


Because of the assumptions, the three polynomials have no common factor. The 
model (4.1) is thus a minimal representation. The polynomials are normalized 
such that both the A and C polynomials are monic, that is, the leading coef- 
ficients are unity. Finally, the C polynomial can be multiplied by an arbitrary 
power of g without changing the correlation structure of C(qg)e(t). This is used 
to normalize C such that 

deg C = depA=n 


The A and B polynomials may have zeros inside or outside the unit disc. It 
is assumed that the zeros of the C polynomial are inside the unit disc. By 
spectral factorization the polynomial C(g) can be changed so that all its zeros 
are inside the unit disc or on the unit circle. An example shows how this is 
done. 


4.2 Design of Minimum-Variance and Moving-Average Controllers 139 


EXAMPLE 4.1 Modification of the polynomial C 


Consider the polynomial 
C(z)=z+2 


which has the zero z = —2 outside the unit disc. Consider the signal 
v(t) = C(a)e(t) 


where {e(t)} is a sequence of uncorrelated random variables with zero mean 
and unit variance. The spectral density of v is given by 


(ei) = = C(e*)C(e**) 
Because 
C(z)C(z*) = (z + 2)(27) +2) = (1 + 227%) (1 + 22) 
= (2z + 1)(227' +1) 
= 4(z + 0.5}(27! + 0.5) 


the signal v may also be represented as 


v(t) = C*(g)e(t) 
where 
C*(z)=2z+1 
is the reciprocal of the polynomial C(z) (see Section 3.2). o 
If the calculations (4.2) give a polynomial C that has zeros outside the unit 
disc, the polynomial is factored as 
CCIC 


where C` contains all factors with zeros outside the unit disc. The C polynomial 
is then replaced by C+C-*. The model (4.1) is an innovations representation. 
It will be shown later that e(¢) is the innovation or the error in predicting the 
signal y(ż) over one sampling period. The C polynomial can be interpreted as 
the characteristic polynomial of the estimator or predictor. 


Criteria 
In steady-state regulation it makes sense to express the criteria in terms of 


the steady-state variances of the output and the control signals. This leads to 
the performance criterion 


J = BE{E) + ou"(0)} (4.3) 


where E denotes mathematical expectation with respect to the noise process 
acting on the system. The control law minimizing (4.3) is the linear quadratic 
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Gaussian (LQG) controller. If ¢ = 0, then the resulting controller is called the 
minimum-variance (MY) controller. 

The properties of the control signal when the minimum-variance controller 
is used depend critically on the sampling interval. A short sampling interval 
gives large variance in the control signal, and a long sampling interval gives 
a low variance. Notice that the loss function (4.3) is defined in discrete time, 
that is, only the behavior at the sampling instances is considered. 

To define the design problem, it is also necessary to define the admissible 
controllers. It will be assumed that u(t) is allowed to be a function of y(t), 
y(t—1), ..., u(t — 1), w(t —2),.... 


Minimum-Variance Control 


It is now assumed that ¢ = 0 and that the process is minimum-phase, that is, 
that the B polynomial has al zeros inside the unit disc. Before we solve the 
general problem, we consider a simple example. 


EXAMPLE 4.2 Minimum-variance control of a first-order system 
Consider the first-order system 


y(t + 1) + ay(t) = bu(t) + eft + 1) + ce(t) (4.4) 


where |c| < 1 and {e(t)} is a sequence of independent random variables with 
unit variance. 

Consider the output at time £+ 1. From (4.4) it follows that by using u(t) 
it is possible to change y(t + 1) arbitrarily. Further, e(t + 1) is independent of 
y(t) and u(t); thus 

var y(t + 1) > var e(t +1) =1 


Given measurements up to time t, we can use Eq. (4.4) to compute e(t). The 
controller , : 
u(t) = a) = cet (4.5) 
gives 
y(t + 1) = e(t + 1) (4.6) 
which gives the lower bound of the variance of y. If Eq. (4.5) is used all the 
time, then from Eq. (4.6), it follows that y(t) = e(t), and we get the controller 


u(t) = =~ y) (4.7) 


o 


The minimum-variance controller can in the general case be derived by 
using similar ideas as Example 4.2. Define 


dy = deg A— deg B 
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as the pole excess of the system. This is the same as the time delay in the 
system. The input at time £ will influence the output first at time t+ dp. Now 
consider 


y(t-+ do) =È u(t + de) +É eft + do) (4.8) 


Let the polynomial F of degree dy — 1 be the quotient, and let the polynomial 
G of degree n ~ 1 be the remainder when g“~!C is divided by A. Hence 


ge Cla) Ga) 
~ AQ) A(”) 
This can be interpreted as a Diophantine equation, 
gq” *C(q) = Ala) F(a) + Ga) (4.9) 
Hence the output at £+ do can be written as 


= F(a) + Fr 


yt + do) = Put +do) + Fe(t+1)+ g? eft) 


where 
F(q) =g + fide? +... + fai (4.10) 
G(q) = gq” * + ong? + +--+ gni (4.11) 
From Eq. (4.1) we can determine eft): 


e(t) = SI- Sule) 


From the measurement of y(t} and u(t) it is thus possible to compute the noise 
sequence, the innovations. This equation is an observer in which the dynamics 
are piven by the C polynomial. It now follows that 


y(t + do) = Felt + 1) + (Za _ 102) u(t) + EA aca 


g V6) 
= Fe(t + 1) + ŽS (%0 — G) u(t) + z y(t) 


= Fe(t+1)+ vr 


u(t) + Z y(t) (4.12) 


where Eq. (4.9) has been used to obtain the last «guar The polynomials qG, 
qBF, and C are all of degree n. This implies that we have divided y(t + do) 
in two parts. The first part, F(g)e(é+ 1), depends on the noise acting on the 
system from £+ 1, ..., £ + do. The second part, 


HE + dolt) = a u(t) + i Z yle) (4.18) 


depends on measured outputs and sontied puts including the u(t) that we 
want to determine. From Eqs. (4.12) it follows that (t+do|é) is the mean square 
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prediction of y(t + do) given data up to and including time t. The prediction 
error is given by 


Ht + dolt) = y(t + do) — H(t + dolt) = F(q)e(t + 1) 


and the variance of the prediction error is 


var f(t + dolt) = o7(1 + f + f+... + Fy) 


Minimum variance of the output is now obtained by the control law 


u(t) =- Fane y(t) (4.14) 


Using this controller gives 
y(t + do) = F(qg)e(t + 1) 
= e(t + do) + fie(é+ do— 1) +...+ fa,re(t+1) (4.15) 


and the minimum output variance is 


var y(t) =o7(14 fi + Ai+..-+ fa) 


which is the same as the variance of the prediction error. Using the controller 
(4.14) gives the closed-loop characteristic equation 


g”C(q)B(q) = 0 


This implies that there are d)—1 poles at the origin, n poles at the zeros of the 
C polynomial, which are inside the unit disc, and n — dy poles at the zeros of 
the B polynomial. Since the system was assumed to be minimum-phase, these 
poles are also inside the unit disc. Observe that minimum-variance control is 
the same as predicting the output dp steps ahead and then choosing the control 
signal such that the predicted value is equal to the desired reference value. See 
Fig. 4.1. 

The minimum-variance controller can be interpreted as a pole placement 
controller, which was discussed in Section 3.2. This is seen by multiplying 
Eq. (4.9) by B, that is, 


g’ CB = AR +BS (4.16) 
where 
R = BF 
S=G 


The pole placement design leads to the controller 


u() = -Z 90) = vd 
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Figure 4.1 Minimum variance control is based on prediction do steps ahead. 


Nonminimum-Phase Systems 


When the system is nonminimum phase, it is not possible to place some of 
the closed-loop poles at the zeros of the B polynomial. It can be shown that 
the optimal controller minimizing Eq. (4.3) with @ = 0 gives the following 
closed-loop characteristic equation: 


q° Bt (q)B™(q)C(q) = 0 
that is, the process zeros outside the unit disc, B- (q), are replaced by the zeros 
defined by the reciprocal polynomial, B—* (q). See Astriém and Wittenmark 
(1990) in the references at the end of the chapter. The controller 

S 
u(t) =F x(t) 

is now obtained from the Diophantine equation 

g B+B™C = AR + BS (4.17) 
Compare Eq. (4.16). 


Moving-Average Controller 


The minimum-variance controller leads to a closed-loop system in which the 
output is a moving average of order dp — 1 (see Eq. (4.15)). It is possible to 
design controllers such that the output is a moving average of higher order. 
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Instead of placing dy — 1 closed-loop poles at the origin, we may place d — 1 
poles, where d > dp. 
The moving-average controller can be derived as follows. Factor the B 
polynomial as 
B(q) = B*(q)B (q) 
where B+ corresponds to well-damped zeros. To obtain a unique factorization, 
it is assumed that B+ is monic. Determine R and S from 
g?1BtC =AR+BS (4.18) 
It follows that Bt must be a factor of R, that is, R = R, Bt. With the feedback 
law & 
fj) = —— y(t 
u(t) = -5 (2) 
we get 


Ast) =B -F ) v) + Cela) 


or 
CR CBtR, 


y(t) = AR + BS e(t) = y BFC e(t) 
= Sh e(t) = (1+rig™ +... + raag") eft) 
where deg R, = d—1 with 
d = deg A — deg Bt 


Since the controlled output is a moving-average process of order d — 1, we call 
the strategy moving-average (MA) control. Notice that no zeros are canceled if 
B*=1 

which means that 
d= degA =n 


The minimum-variance controller and the moving-average controller are simi- 
lar. The only difference is the value of the integer d, which controls the number 
of process zeros that are canceled. With d = dp, all process zeros are canceled; 
with d = deg A = n, no process zeros are canceled. 


EXAMPLE 4.3 Moving-average controller 
Consider the system (4.1) with 


Alga) = @ +a +a 
B(q) = bog + bı 
Cla) =+ ag+cez 
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In this case, do = 1. The minimum-variance controller is obtained from 
Eq. (4.9), giving the controller 


(c1 — a1) + (c2 — a2)q™? 
u(t) = a ae e y(t) 
and the closed-loop system is 
y(t) = e(t) 

The minimum-variance controller can be used only if |b1/6o| < 1, that is, for 
the minimum-phase case. 

The moving-average controller is obtained by solving Eq. (4.18). In this 
case, d = 2 and B*(q) = 1. This gives the Diophantine equation 


g(a” + e1g + c2) = (g? +019 + a2)(q +11) + (bog + b1)(sog + 1) 
Notice that this is the same as Eq. (3.19) with A,(q) = q and A,,(q) = C(q). 
The solution is thus given by Eqs. (3.20) and (3.21): 
a (as — c2)bobı + (c — @)b? 
b? + aıbobi + azb? 
s= by (a? — az — ciar + c2) + bo(e1a2 — 2102) 
b? + arbobi + azb? 


s= bi (aiaz — c1a2) + boflagca — a3) 
t b? + arbob: + azb? 


ry 


The closed-loop system is 
ot) = (1+ nigel) o 


LOG Control 


The pole placement and LQG problems are closely related. In the LQG formu- 
lation a loss function is specified. Minimization of the loss function leads to a 
fixed-gain controller that can be interpreted in terms of pole placement. The 
details are given in Section 4.5. To obtain the LQG solution, it is first necessary 
to solve the spectral factorization problem, that is, to find the nth-order monic, 
stable polynomial P(q) that satisfies 


rP(g)P(q*) = AlAT) + BQ) Bg") (4.19) 
The LQG-controller is then obtained as the solution to the Diophantine equa- 
tion 
C(q)P(g) = A(g)R(q) + B(g)S(q) (4.20) 
To get a unique solution with deg R = deg S = n, it is necessary to make some 
further restrictions to the solution given by Eq. (4.20). See Theorem 4.3 in 
Section 4.5. The interpretation of Eq. (4.20) is that the LQG-controller places 
the closed-loop poles in P(g), given by the spectral factorization, and in C(q), 
which characterizes the disturbances. 
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Summary 


The minimum-variance controller, the moving-average controller, and the LQG- 
controller can all be interpreted as pole placement design as discussed in 
Section 3.2. The mimnimum-variance controller is obtained by solving the Dio- 
phantine equation (4.16) for the minimum-phase case or Eq. (4.17) for the 
nonminimum-phase case. The moving-average controller is given by Eq. (4.18) 
and the LQG-controller by Eq. (4.20). The closed-loop characteristic polyno- 
mial is chosen differently for each of the design methods. 


4.3 STOCHASTIC SELF-TUNING REGULATORS 


Indirect Self-tuning Regulator 


A straightforward way to make a self-tuning regulator for the process (4.1) is 
to estimate the parameters in the A, B, and C polynomials by using, for in- 
stance, the extended least squares (ELS) algorithm or the recursive maximum- 
likelihood (RML) algorithm. (See Section 2.2.) The estimated parameters are 
then used in the design equation (4.9) if minimum-variance control is desired 
or in Eq. (4.20) if LQG control is desired. 


EXAMPLE 44 Stochastic indirect self-tuning regulator 


Consider the process (4.4) in Example 4.2 with a = —0.9,b = 3, and c = —0.3. 
The minimum-variance controller is given by the proportional controller 


a—c 
u(t) = E y(t} = —soy(t) = —0.2y(¢) 
This gives the closed-loop system 


y(t) = e(t) 
The ELS method is used to estimate the unknown parameters a,b, and e. The 
estimates are obtained from Eq. (2.21) with 

eT ={a b c) 
gp (t—1)=(-x-1) u(t—1) e(t—1)) 
e(t) = y(t) — g7 (t — 1)6(t — 1) 
The controller is 
a(t) — é() 


u(t) = bay S 
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Figure 4.2 Output and input when an indirect self-tuning regulator based 
on minimum-variance control is used to control the system in Example 4.4. 


Figure 4.2 shows the result of a simulation of the algorithm. The initial values 
in the simulation are 


&(0) =0 
§(0) =1 
é(0) = 0 
P(0) = 1007 


Figure 4.3 shows the accumulated loss 


v=) 70 


i=l 


when the optimal minimum-variance controller and the indirect self-tuning 
regulator are used. The curve of the accumulated loss of the STR is al- 
most parallel to the optimal curve. This means that the performance of the 
self-tuning regulator is almost optimal except for a short startup transient. 
Figure 4.4 shows the estimated process parameters. The parameter estimates 
have not converged to the true values during the simulated period. However, 
the controller parameter ŝo(t) = (â(t) — é(t)) /b(t) converges faster, as can be 
seen in Fig. 4.5. For a fixed controller the closed-loop system is stable when 
—0.03 < so < 0.63. Notice that during some of the first steps the controller 
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Minimum variance control 
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Figure 4.3 The accumulated loss when a self-tuning regulator and the 
optimal minimum-variance controller are used on the system in Example 4.4. 


parameter §o{t) is such that the closed-loop system would be unstable if the 
controller were frozen to those values. 

The reason for the poor convergence of the three estimated process param- 
eters is that the controller converges rapidly to a minimum-variance controller. 
After that there is poor excitation of the process. The example shows that the 
self-tuning controller compares well with the optimal controller for the known 
system. From the control law it can be seen that there may be numerical prob- 
lems when 6(t) is small. O 


Direct Minimum-Variance and Moving-Average STR 


The design calculations for the indirect self-tuning regulators include the so- 
lution of a system of equations such as the Diophantine equation (4.18) or 
(4.20). The time to solve the Diophantine equation may be long in comparison 
with the sampling period. A self-tuning regulator that directly estimates the 
controller parameters eliminates the design calculations. It is thus desirable to 
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Figure 4.5 The controller parameter §)(#) when the system in Example 4.4 
is controlied. The dashed line is the optimal parameter for the minimum- 
variance controller. 


construct direct self-tuning algorithms. Deterministic direct self-tuners were 
discussed in Section 3.5. The idea is to use the specification and the process 
model to make a reparameterization of the system. The same idea will now 
be used for stochastic systems of the form (4.1). In Section 4.2 it was shown 
that minimum-variance control is the same as predicting the output do steps 
ahead and then determining the control signal u(t) such that the predicted 
value is equal to the desired output. Consider the reparameterization (4.12), 
and rewrite the model in the backward shift operator. This gives 


y(t + do) = 5 (R* u(t) + S*y(t)} + Rye(t + do) (4.21) 


where R] = F* and deg Ri = dọ — L. Using Eq. (4.18), we get, in the same way, 


y(t + d} = -m (R*u(t) + S*y(t))} + Rie(t + d) (4.22) 


where deg Ry = d—1. 

The factors 1/C* and B-*/C* in Eqs. (4.21) and (4.22), respectively, can be 
interpreted as filters for the regressors. (Compare Section 3.5.) Both equations 
are now written in predictor form, where the controller polynomials R and S 
appear directly in the model. These equations can be used as a motivation for 
the following algorithm. 


ALGORITHM 41 Basic direct self-tuning algorithm 


Data: Given the prediction horizon d, let $ and Z be the degrees of the R* and 
S* polynomials, respectively. Let Q*/P* be a stable filter. 


Step 1: Estimate the coefficients of the polynomials R* and S* of the model 


y(t + d) = R*(q™ ug (t) + S*(9)y¢(t) + elt +d) (4.23) 
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where 
R’(q")=rotng i + trg” 
S*(q7) = sot sig? + +s 
and + 1 
ur) = Fay wl 


v(t) = FI v0 


using Eq. (2.21) with 


E(t) = y(t) — R'up(t— d) — S*yr(t — d) = y(t) — p° (t — d)O(t — 1) 


pra = 2) (wy. ut- yO o E-D) 


P*(q") 
OF =(ro ... TR 8 -c sı) 
Step 2: Calculate the control signal from 
R*(g7 u(t) = —S*(9)9(#) (4.24) 
with Æ and S* given by the estimates obtained in Step 1. 
Repeat Steps 1 and 2 at each sampling period. Oo 


Remark I. Notice that this algorithm is the same as Algorithm 3.3 when 
u, = 0, but with different filters. 


Remark 2. The parameter ro can either be estimated or be assumed to be 
known. In the latter case it is convenient to write R" as 


Rg) =ro(lergtt+: +40") 


and use 
E(t) = y(t) — row p(t — d) — p" (t — d)6(t— 1) 
Tray _ Q* (g?) L _ _ 
gp’ (t) = Pi (rou(t—1) ... roult—k) yt) ... x(t—2)) 
OT =(r, -Th So ... S) 
Asymptotic Properties 


The models of Eqs. (4.21) and (4.22) can be interpreted as reparameterizations 
of the process model of Eq. (4.1) in terms of the controller parameters. They 
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are identical to the model of Eq. (4.23) in Algorithm 4.1 if the filter Q*/P* 
is chosen to be 1/C” and B*—/C", respectively. The regression vector is then 
uncorrelated with the errors, and the least-squares estimate can be expected to 
converge to the true parameters. The C* and 8~* polynomials are not known, 
however, The surprising result is that the algorithm also self-tunes to the 
correct controller even when the filter is not correct. This property inspired 
the authors of this book to introduce the term “self-tuning.” The following 
result shows that the correct controller parameters are equilibrium values for 
Algorithm 4.1 for an incorrect choice of Q*/P* also. A more detailed analysis 
of stability and convergence is found in Chapter 6. 


THEOREM 4.1 Asymptotic properties 1 


Let Algorithm 4.1 with Q*/F* = 1 be used with a least-squares estimator. The 
parameter ro = bo can be either fixed or estimated. Assume that the regression 
vectors are bounded, and assume that the parameter estimates converge. The 
closed-loop system obtained in the limit is then characterized by 


yt+t)yt)=0 t=d,d+l,...,d+l 
ee (4.25) 
yétru(t)=0 t=d,d+l1,...,.d+k 


where the overbar indicates a time average. Also, k and 7 are the degrees of 
the polynomials A* and S*, respectively. 
Proof: The model of Eq. (4.23) can be written as 
y(t +d) = o7 (8 + elt +d) 
and the control law becomes 
gy? (t)@(t +d) =0 (4.26) 


At an equilibrium the estimated parameters 6 are constant. Furthermore, they 
satisfy the normal equations (2.5), which in this case are written as 


TY gtk +4) = = S oeo HOE + a) 
k=} k=l 


By using the control law it follows from Eq. (4.26) that 


lim = S gp(k)y(k +d) = lim = 5 p(k)o7 (k) (Olt + d) — 84 + d) 
k=1 k=1 


tooo ft 


If the estimate 6(t) converges as £ — oo and the regression vector (k) is 
bounded, the right-hand side goes to zero. Equation (4.25) now follows from 
@*/P* = 1 and the definition of the regression vector in Algorithm 4.1. oO 


Stronger statements can be made if more is assumed about the system to 
be controlled. 
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THEOREM 42 Asymptotic properties 2 


Assume that Algorithm 4.1 with least-squares estimation is applied to Eq. (4.1) 
and that 
min(k,l) > n—1 (4.27) 


If the asymptotic estimates of R* and S* are relatively prime, the equilibrium 
solution is such that l 


yt +rt)y(t)=0 t=d,d+1,... (4.28) 


that is, the output is a moving-average process of order d — 1. 
Proof: The closed-loop system is described by 
R*u(t) = —S*y() 
A*y(t) = B*u(t — do) + C*e(t) 
Hence 
(A*R* + g% B*S*)y = R'C*e 
(A*R* + gq” B*S*)u = => Ce 
Introduce the signal w defined by 
(A*R* + gT% B’S*)w = C'e (4.29) 


Hence 
y=R*w and u=—-S*w (4.30) 


The condition of Eq. (4.25) then implies that 
Rw(t)y(t +7) = 0 t=d,d+1,...,d+1 
Stw(t)y(t+t)=0 t=d,d+l1,...,d+k 


If we introduce 
Cuy(t) = wlé)y(t +T) 


the preceding equations can be written as 


To Ti Fo «es Fe Q a O 
0 ro Fi Ta ... TR 
Cuy(d) 
O ... O ro ry Ta oes Tk ; 
$o Si Sg a. SI 0O ... @ : =0 
0 So S1 S&S ... S Cuy(d + k +1) 
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Since the Sylvester matrix on the left is nonsingular when R* and S* are 
relatively prime (compare Section 11.4), it follows that 


Cuy(t) = 0 ts=d,d+1,....d+k+i 


The covariance function satisfies the equation 
F*(q")Gy(t)=0 T20 
The system of Eq. (4.29) has the order 
n + k = n + max(k,!L) 


If 
k+l+1>n+max(k,l) 


or, equivalently, 
min(k,l} >n—1 


it follows that 
Ciy(t) = 0 ¢=d,d+1,... 
It also follows from Eq. (4.30) that 


C(t)=0 t=d,d+1,... 
which completes the proof. E 


Remark 1. The algorithm thus drives the correlation of the output to zero 
starting at lag 7 = d. It follows from Theorem 4.1 that the correlations at lags 
d, d + 1,..., d +i will always be zero at equilibrium. If there are enough 
parameters in the controller, the covariance of the output will be zero for 
all higher lags. Notice that the condition of Eq. (4.28) is easily checked by 
monitoring the covariances of the output. 


Remark 2. It is possible to influence cancellation of the process zeros simply 
by choosing the integer d. With d = dp a controller that cancels all zeros is 
obtained. With d = n the controller will not cancel any process zeros. oO 


Theorems 4.1 and 4.2 imply that if the estimates converge, and if there 
are sufficiently many parameters in the controller, then Algorithm 4.1 will 
converge to the moving-average controller. 


Examples 


The properties of the minimum-variance and moving-average self-tuners are 
illustrated with two examples. 
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Figure 4.6 The parameter p/s in the controller, when the process in 
Example 4.5 is controlled by using the direct minimum-variance self-tuning 
controller. 


EXAMPLE 45 Direct minimum-variance self-tuning regulator 
Consider the same process as in Example 4.4. The process model of Eq. (4.23) 
is now 
y(t + 1) = rou(é) + soy(t) + E(t + 1) 


It is assumed that ro is fixed to the value fo = 1. Notice that this is different 
from the true value, which is 3. The parameter sọ is estimated by using the 
least-squares method. The control law becomes 


u(t) = -2 v(t) 


Figure 4.6 shows §)/7o, which is seen to converge rapidly to a value corre- 
sponding to the value of the optimal minimum-variance controller, even if Fp is 
not equal to its true value. This is also seen in Fig. 4.7, which shows the loss 
function when the self-tuner and the optimal minimum-variance controller are 
used. Compare Figs. 4.3 and 4.5. D 


Minimum variance control 
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Time 


Figure 4.7 The loss function when the direct self-tuning regulator and the 
optimal minimum-variance controller are used on the system in Example 4.5. 
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EXAMPLE 4.6 MA control of a nonminimum-phase system 


Consider an integrator with a time delay r. For the sampling period h >< the 
system is described by 


A(q) = g(¢-1) 
B(q) = (h-r) +r = (h—-1)(g +b) 


where 
T 


A—t 
The noise is assumed to be characterized by 
Cig)=a(gte) = |el <1 


The sampled-data system is nonminimum-phase if t > h/2. This implies that 
the basic minimum-variance self-tuner can be used only if 7 < h/2. Let the 


b= and dg =l1 


a 
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Figure 4.8 Simulation of the self-tuning algorithm on the integrator with 
time delay in Example 4.6. At £ = 100 the delay is changed from 0.4 to 0.6. 
(a) d = 1; (b) d = 2. 
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controller have the structure 


u(t) = —8o(t)y(t) — Fi(éju(t —1) 


Simulations of the system are shown in Fig. 4.8 for A = 1 and e = —0.8. 
The time delay is initially 0.4 and is increased to 0.6 at time ¢ = 100, at 
which time the sampled-data system gets a zero outside the unit circle. Figure 
4.8(a) shows the results obtained with d = 1, the minimum-variance structure. 
The parameters first converge toward the minimum-variance controller. At 
t = 100 the sampled-data system gets a zero outside the unit circle. The 
self-tuning regulator then tries to cancel the zero, and the closed-loop system 
becomes unstable after some time. It does not become unstable exactly at 
t = 100 because it takes a while for the controller parameters to change. 
The control signal is limited to +20, which explains why the signals do not 
grow exponentially. The forgetting factor is A = 0.99. Figure 4.8(b) shows the 
results for the algorithm with d = 2. The moving-average controller is a stable 
equilibrium for both r = 0.4 and t = 0.6. There will be a shift in the parameter 
values when the delay is changed, but the closed-loop system is stable. 

The controller that gives the smallest attainable variance of the output 
gives the standard deviations 1.000 and 1.004 when zr = 0.4 and 0.6, respec- 
tively, while the moving-average controller gives the standard deviations 1.003 
and 1.007 when r = 0.4 and 0.6, respectively. Degradation in the performance 
when the moving-average controller is used in this example is thus minor. O 


44 UNIFICATION OF DIRECT SELF-TUNING REGULATORS 


The moving-average self-tuner is attractive because of its simplicity. It is easy 
to explain intuitively how the algorithm works, and the algorithm is easy to 
implement. This has led to great interest in the algorithm. The algorithm can 
be explained as follows: Determine the structure of a predictor that can be 
used to predict the output d steps ahead. The parameters of the predictor are 
estimated in real time. On the basis of the estimated parameters the control 
signal is determined such that the predicted output of the process is equal to 
the reference value. The algorithm has been analyzed extensively. The closed- 
loop bandwidth depends critically on the sampling period A and the prediction 
horizon d, so both must be chosen with care. The algorithm may result in 
a controller in which process zeros are canceled; the cancellations depend on 
the choice of prediction horizon. Many variants of the algorithm have been 
suggested. A number of these can be described in a unified framework, as we 
will demonstrate. 
Consider the model of Eq. (4.1), and introduce the filtered output 


Q*(q~*) 
P*(q") 


y(t) = y(t) 
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where Q* and P* are stable polynomials. The filtered output satisfies the 
equation 

A" (q>) P (a )y¢(t) = Ba") Q*(g ult — do) + C* (q™°)Q* (a> )e(2) 
Introduce the identity 
C* (qR (a?) HAE NPG Rg") + AS’) 
Then 


1 


yr(t + do) = C*Q* 


Introducing 


(S*y-(t) + B*Q*Riu(t)} + Ry e(t + do) 


IO = Bay HO = pO 


gives the model 


yet + do) = a (S*/(t) + B'Riu(t)) + Ryelt+do) (431) 


By analogy with Eq. (4.21) this model structure could be used with Algo- 
rithm 4.1 to derive a self-tuning regulator for minimization of the variance 
of yp. This reparameterized model now suggests the following generalized self- 
tuning algorithm. 


ALGORITHM 4.2 Generalized direct self-tuning algorithm 


Data: Given the prediction horizon, d, the order of the controller, deg R* and 
deg S*, the stable observer polynomial, A*, and the stable polynomials Q* and 
P*, define the filtered signals 


yt) = Soft) lt) = Fol 


Step 1: Estimate the coefficients of the polynomials R* and S* of the model 


yrlt +d) = T u(t) + z Yp) +e +a) (4.32) 


using the least-squares method. 
Step 2: Calculate the control signal from 


S* 
| u(t) = -5 y) 
with R* and S* given by the estimates obtained in Step 1. 
Repeat Steps 1 and 2 at each sampling period. O 
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From Eq. (4.31} and Theorems 4.1 and 4.2 it follows that if the estimates 
converge, then the closed-loop system will be 


y¢(t} = Rie(t) 


or 
P*R 

y(t) = g * e(t) (4.33) 

where Fj is given by the identity 
C*Q* = APR + g@BOS* (4.34) 

and the control signal is given by 

Ss" S* 

u(t) =- r) = ~ pap) (4.35) 


where R* = Bt*Rj. The closed-loop poles will thus be influenced by Q*, and 
additional zeros can be introduced through P*. The introduction of the filter 
Q* /P* gives what is sometimes called a detuned minimum-variance algorithm. 

Algorithm 4.2 is essentially the same as Algorithm 4.1 applied to filtered 
signals. The filter Q*/P* and the prediction horizon will determine the pulse 
transfer operator of the closed-loop system. The optimal observer polynomial 
is C*, which is unknown. Instead, an approximation Aj is used. The observer 
polynomial A* will determine the convergence properties. This will not influ- 
ence the asymptotic properties as long as the filter Q*/P* and its inverse are 
stable. 

Minimum-variance control may result in large control signals. One way to 
decrease the variation of the control signal is to generalize the loss function 
such that it also contains a penalty on the control signal. Linear quadratic 
controllers are of this type; a minor drawback with linear quadratic self-tuning 
regulators is the computational burden. One way to simplify the problems is 
to use a loss function of the form 


E { (Pa +do))" + (Que) \%} 
where 
Yı = {y(t), yE — 1), ..., (0), u(t), u(t — 1), ..., u(0)} 


that is, the data available at time t. The resulting controller is sometimes called 
a generalized minimum-variance controller. This controller can be interpreted 
in the same framework as above. To illustrate this, assume that P* = 1 and 
that Q* = yP. This gives the loss function 


E {y(t + do) + POLN} (4.36) 


Notice that the loss function depends only on the output y at time £ + do, that 
is, at only one time instant. Loss functions of the form (4.36) are sometimes 
called one-stage loss functions. 
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Figure 4.9 Equivalent systems. 


Assume that the process is governed by Eq. (4.1). By using the represen- 
tation of the process dynamics given by Eq. (4.21) it can be shown that the 
control law that minimizes Eq. (4.36) is 


(m+ 2c) uy = -s70 as 


where 
R* = R] B* 


and R* and S* are given by Eq. (4.16). 

By using the same idea it is possible to construct a new system, which 
has Eq. (4.37) as its minimum-variance controller. Augment the original sys- 
tem with a parallel connection with the pulse transfer operator pq~/ro (see 
Fig. 4.9). This is in fact a standard technique to obtain an equivalent controller 
with a bounded gain. The input-output relation of the augmented system is 


Ayali) = G + La) u(t — do) + C"e(t) 
The minimum-variance control law for this system is given by 
Ri G + 2a) u(t) = —S* ya (t) (4.38) 
where Ri and S* satisfy Eq. (4.16). It follows from Fig. 4.9 that 


ya(t) = y(t) + © quli) 
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Then Eq. (4.38) can be written as 


(RE +£ A°F;) u(t) = —S* (ow + P gui) 


ro 
or 
(ziz + £ (A*Ri + r#s*)) u(t) = —S*y(t) 


Equation (4.16) gives 


(riz + 2 c) u(t) = —S*y(t) 


which is identical to Eq. (4.37). Notice that with the control law of Eq. (4.38) 
the canceled factor is not B* but B* + pA*/ro. This implies that problems can 
be expected when the system is nonminimum-phase and close to the stability 
boundary. 

In the generalized minimum-variance control algorithm it is assumed that 
C*(q-!) = 1. The algorithm can thus be obtained simply by adding a parallel 
path to the original system and applying an ordinary self-tuning regulator 
based on minimum-variance control to the augmented system. The control 
gain is adjusted simply by changing the parameter p of the parallel path. 

The preceding analysis shows that Algorithm 4.2 is very flexible. It can 
be used for many different types of specifications, not only for minimum- 
variance control. This is very important for the implementation of self-tuning 
regulators. 


Self-tuning Feedforward Control 


Feedforward control is a very useful way to reduce the influence of known 
disturbances. Examples of measurable disturbances can be temperatures and 
concentrations in incoming product streams in chemical processes, outdoor 
temperature in climate control systems, and thickness of the paper in paper 
machines. Command signals can also be interpreted as a measurable distur- 
bance. The controller in Eq. (3.2) can be interpreted as feedforward from the 
command signal. To use feedforward, it is necessary to know the dynamics of 
the process. It is, however, also possible to establish self-tuning feedforward 
compensation. One way to do this is to postulate a model structure of the form 


y(t +d) = R*u(t) + S*y(t) + T’v(t) + E(t + d) 


where v(t) is the measurable disturbance acting on the system. The signal v 
could also be the reference value. The polynomials R*, S*, and T* are estimated 
in the usual way, and the control law is chosen to be 


ud = -Žr - Eo) 


Self-tuning feedforward control has been used successfully in many industrial 
applications. 
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Examples 
The behavior of Algorithm 4.2 is illustrated through two examples. 


EXAMPLE 4.7 Effect of filtering 
Consider the process 


y(t) + ay(t — 1) = bu(t — 1) + e(t) + ce(t — 1) 


where a = —0.9, b = 3, and c = —0.3, which is the same process as in 
Examples 4.4 and 4.5. Let the filter be 

Q l+aig™ 

P olt+pqt 


The identity of Eq. (4.34) gives the solution 


So =c+q—a— pi 
81 = ¢¢1 — Qapi 
The control law is given by Eq. (4.35), with 
Ri P'B™ = b(1 + pig’) 


0 0.02 0.04 V, 


Figure 4.10 The output variance, V,, and input variance, Vp, as functions of 
qı of the system in Example 4.7 when pı = —0.3, 0, and 0.3. Three different 
cases are indicated by dots: {a} p = qı = 0; (b) m = 0, qi = —0.3; (c) 
Pi = —0.3, qi = —0.9. 
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Figure 4.11 Simulation of the generalized self-tuning algorithm on the 
system in Example 4.7 when (a) pı = q; = 0 (minimum-variance control); 
(b) pı = 0, qı = —0.3; (c) pı = —0.3, qı = —0.9 (open-loop system). 


The closed-loop system becomes 


1+pi.q 
i} = ———— eit 


8 +8197 
MO = Farag 


There are many different ways to choose the filter Q*/P*. In principle it should 
be a phase-advance network. This implies that the closed-loop system given by 
Eq. (4.33) will be low-pass filtered. Figure 4.10 shows how the output and 
input variances change with q, for some values of pı. Case (a) in Fig. 4.10 
corresponds to minimum-variance control. In case (b) the output variance is 
increased by 10%, and the input variance is reduced by about 60% compared 
with the minimum-variance case. In case (c) the input variance is zero; that is, 
the system is open-loop. Figure 4.11 shows the accumulated losses for the input 
and the output when the generalized self-tuning algorithm is used. Cases (a), 
(b), and (c) are the same as in Fig. 4.10. oO 


EXAMPLE48 Generalized minimum variance self-tuning controller 


The self-tuning controller that minimizes Eq. (4.36) will now be used to control 
the same system as in the previous example. The controller in Eq. (4.37), with 
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Figure 4.12 The output variance V, as a function of the input variance V, 
in Example 4.8 for different values of p: (a) p = 0; (b) p = 4; (c) p = 100. 


R* and S* given by Eq. (4.16), is 


u(t) = e—a yele) 


b+p(l+ag 
Figure 4.12 shows the output variance as a function of the input variance 
for different values of p. The curve has the same gross behavior as shown in 
Fig. 4.10. However, the parameter p may be easier to choose than the filter in 
Example 4.7. Figure 4.13 shows the accumulated losses of the output and the 
input for different values of p when the self-tuner in Algorithm 4.1 is used on 
the augmented system shown in Fig. 4.9. Compare Fig. 4.11. Oo 


Summary 


There are many ways to make direct self-tuning regulators with good prop- 
erties. The amount of computation is moderate, since the design calculations 
are eliminated. It has been shown that the generalized direct self-tuning al- 
gorithm, Algorithm 4.2, is very flexible. By using the filter Q*/P* and the 
prediction horizon, it is possible to determine the behavior of the closed-loop 
system. It is possible to choose, for instance, moving-average control, general- 
ized minimum-variance control, or pole-zero placement control. 

The observer polynomial does not influence the asymptotic properties. It 
will instead influence the transient properties and can be used to improve the 
convergence properties of the algorithm. The robustness and sensitivity of the 
algorithm are also influenced by the filter Q*/P*. 

For simplicity, Algorithm 4.2 has been derived for the regulator case, in 
which the reference value is equal to zero. It is easy to modify the algorithm 
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Figure 4.13 Simulation of the generalized minimum variance self-tuning 
algorithm on the system in Example 4.8 when (a) p = 0 (minimum-variance 
control); (b) p = 4; (c) p = 100 (“almost” open-loop control). 


such that the output follows a reference trajectory; some ideas are suggested 
in the problems at the end of this chapter and in Section 11.3. 


4.5 LINEAR QUADRATIC STR 


The linear quadratic design procedure can also be used as the design method 
in a self-tuning regulator. Consider the process model 


A(g)y(t) = B(g)u(é) + C(g)elt) (4.39) 
and the steady-state loss function 
Jya = E { (E) — Yml + pu” (0) } (4.40) 


The optimal feedback law that minimizes Eq. (4.40) for the system of Eq. (4.39) 
is given by the following theorem. 


THEOREM 43 LQG control 


Consider the system in Eg. (4.39). Let the monic polynomials A(g) and C(g) 
have degree n. Assume that C(q) has all its zeros inside the unit disc, and 
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assume that there is no nontrivial polynomial that divides A(g), B(q), and 
C(q). Let Az2(g) be the greatest common divisor of A(g) and B (q), let A} (q) of 
degree } be the factor of Ag(g) with all its zeros inside the unit disc, and let 
Az (q) of degree m be the factor of A(g) that has all its zeros outside the unit 
disc or on the unit circle. 
The admissible control law that minimizes Eq. (4.40) with p > 0 is then 
given by l 
R(q)u(t) = —S(g)v(t) + T(a)ymn(2) (4.41) 


where R and S are of degree n+ m 


R(a) = Az (4) R(q) 


S(q) = 2"S(@) (442 
and R(g) and Š(q) satisfy the Diophantine equation 
A1(9)Az (@) R(q) + q” Bi(g)S(q) = Pi(g)C(q) (4.43) 


with deg R(q) = deg S(q) = n and (0) = 0. Furthermore, 

A(q) = Ai(q)A2(q) 

B(q) = Bi(q)A2(q) 

B(q) = Bi(g)Az (9) 
The polynomial P(g) is given by 

P(q) = Az (9) Pi(Q) (4.44) 
where P,(q) is the solution of the spectral factorization problem 

rP\(q)Pi(q*) = pAr(a) Az (Ar (9) Ag (9) + Bilg)Bilg") (4.45) 
with deg P;(g) = deg Ai (q) + deg A; (q). The polynomial T'(g) is given by 
T (4) = tog”C(q) 

where 


fo = P,(1)/Bi (1) O 


A proof of the theorem is found in Astrém and Wittenmark (1990). 
Remark. By using Eqs. (4.42) the identity (4.43) can be written as 


A(q)R(q) + B(g)S(q) = A2(@)Pila)C(a) 


The LQG solution can thus be interpreted as a pole-placement controller, where 
the poles are positioned at the zeros of Az, Pi, and C. The controller also has the 
property that Az divides K. This is an example of the internal model principle. 
Using the internal model principle implies that a model of the disturbance is 
included in the controller. O 
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To solve the design problem, it is necessary to solve the spectral factoriza- 
tion problem of Eq. (4.45) and to solve the Diophantine equation Eq. (4.43). 
The solution to the LQG problem given by Theorem 4.3 is closely related to 
the pole placement design problem. The solution to the spectral factorization 
problem gives the desired closed-loop poles. The second part of the algorithm 
can be interpreted as a pole placement problem. 

An alternative solution to the design problem is to use a state space 
formulation. The process model of Eq. (4.39) can be written in state space 
form as _ _ E 

x(t + 1) = Ax(t) + Bu(t) + Ke(t) 
y(t) = Cx(t) + e(t) 


where the matrices A, B, C, and K are given in the canonical form 


(4.46) 


—a1 1 0O ... 0 
A= 

—a,-; 0 .-. 1 

—ün 0 wee 0 
B=(0 ... O bo ... bm)? 
C=(1 0... 0) 
K = (i-a ... Cn — an) 


where m = n—dy. The model in Eq. (4.46) is called the innovation model, and K 
is the optimal steady-state gain in the Kalman filter, that is, £(£+1|f) = x(¢+1). 
It is also possible to derive the filter for £(#|t), which is given by 


alele) = (ql -A + KO) (Bult) + Ky(2)) 


By using the definitions of A, K, and Č it is easily seen that det(gi—A+KC) = 
C(q). That is, the optimal observer polynomial is equal to C(q). 
Introduce the loss function 


N 
J =E [roe + pu? (t) + (aoe (4.47) 


#=1 


The optimal controller is given by 
u(t) = —L(e)£(e|t) (4.48) 
where L(t} is a time-varying feedback gain given through a Riccati equation 
S(t) = (A- ÈLA) S(t + 1)(A- BL) + Qi + PLT (OL 
L(t) = {p + B7S(¢+1)B)” BT S(t + 1A 
with S({N) = Qo. The limiting controller 
L = lim L(t) 


(4.49) 
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is such that the closed-loop characteristic equation is 
P(q) = det(q —A+ BL) =0 


where P(q) is the same as in Eq. (4.44). 

The two solutions to the LQG control problem suggest two ways to construct 
indirect linear quadratic self-tuning regulators. In both algorithms it is first 
necessary to estimate the A, B, and C polynomials in the process model of 
Eq. (4.39). This can be done by using the recursive maximum-likelihood method 
or the extended least-squares method. This leads to the following algorithm. 


ALGORITHM 43 Indirect LQG-STR based on spectral factorization 


Data: Given specifications in the form of the parameter p in the loss function 
of Eq. (4.40) and the order of the system. 


Step 1: Estimate the coefficients of the polynomials A, B, and C in Eq. (4.39). 


Step 2: Replace A, B, and C with the estimates obtained in Step 1 and solve 
the spectral factorization problem of Eq. (4.45) to obtain P{q). 


Step 3: Solve the Diophantine equation of Eq. (4.48). 

Step 4: Calculate the control signal from Eq. (4.41). 

Repeat Steps 1, 2, 3, and 4 at each sampling period. m 
The state space formulation gives the following algorithm. 


ALGORITHM 4.4 Indirect LQG-STR based on the Riccati equation 


Data: Given specifications in the form of the parameters Qo, Q1, and p in the 
loss function of Eq. (4.47) and the order of the system. 


Step1: Estimate the coefficients of the polynomials A, B, and C in Eq. (4.39). 


Step 2: Replace A, B, and C with the estimates obtained in Step 1 and solve 
the algebraic Riccati equation or iterate Eqs. (4.49) to obtain L. 


Step 3: Calculate the control signal from Eq. (4.48). 
Repeat Steps 1, 2, and 3 at each sampling period. Oo 


Notice that if Q; = C’C, the steady-state solution to Eqs. (4.49) will give 
the same result as the minimization of Eq. (4.40}. Algorithms 4.3 and 4.4 are 
indirect algorithms that are able to handle nonminimum-phase systems and 
varying time delays. The computations are more extensive for these algorithms 
than for the simple self-tuning regulators discussed above. 

Solution of the spectral factorization or the Riccati equation is the major 
computation in an LQG self-tuner. These calculations can be made in many 
different ways. The Riccati equation can be solved by using an eigenvalue 
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method or by some iterative method. The iterative methods will in general 
lead to shorter code. In general the Riccati equation is iterated several steps. To 
guarantee that the calculations can be done in a prescribed sampling interval, 
it is necessary to truncate the iterations; it is important that a reasonable 
result be obtained when the iteration is truncated. For instance, the polynomial 
P in the spectral factorization must be stable. This is guaranteed for some 
algorithms. In some algorithms it is suggested that the Riccati equation be 
iterated only one step at each sampling. 


46 ADAPTIVE PREDICTIVE CONTROL 


Algorithm 4.1 is one way to make a controller with a variable prediction 
horizon. The underlying control problem is the moving-average controller, The 
moving-average controller may also be used for nonminimum-phase systems, 
as was illustrated in Section 4.3. 

In using the minimum-variance controller or the moving-average controller 
the output is predicted only at one future time. The prediction horizon d is then 
a design parameter. The predicted output can also be computed for different 
prediction horizons and then used in a loss function. Several ways to achieve 
predictive control have been suggested in the literature; we now discuss and 
analyze some of these. The case with known parameters is first analyzed before 
the adaptive versions are discussed. 

Predictive control algorithms are based on an assumed model of the process 
and on an assumed scenario for the future control signals. This gives a sequence 
of control signals. Only the first one is applied to the process, and a new se- 
quence of control signals is calculated when a new measurement is obtained. 
This is called a receding-horizon controller. There are many variants of pre- 
dictive control, for instance, model predictive control, dynamic matrix control, 
generalized predictive control, and extended horizon control. The methodology 
has been used extensively in chemical process control. 


Output Prediction 


One basic idea in the predictive control algorithms is to rewrite the process 
model to get an explicit expression for the output at a future time. Compare 
Eq. (4.22). Consider the deterministic process 


A*(q-*)y(é) = B*(q7*)u(t — do) (4.50) 
and introduce the identity 


1 = A*(q) F(a") + 9 GAl) (4.51) 
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where 
deg Fi =d—1 
deg G5 =n—1 
The subscript d is used to indicate that the prediction horizon is d steps. It 


is assumed that d > do. The polynomial identity of Eq. (4. 51) can be used to 
predict the output d steps ahead. Hence 


y(t + d) = A*Fyy(t + d) + Gyy(t) = B*Fiu(t +d— do) + Ggy(t) (4-52) 
Compare Eq. (4.12). Introduce 
BY (q) Fi) = Rala) + RG) 
where 
deg R} = d — dh 
deg R} =n —2 


The coefficients of R} are the first d — dy + 1 terms of the pulse response of 
the open-loop system. This can be seen as follows: 


G* 
— ph Re -dTa 
ae TB (r; +q i) 
-n en pee lh, BT GH) i 
= g7*R* 1) 4 (d+1) fe 1 + d (d+do) 
q”Ra +a alg) Ae) 
The powers of the last two terms are at least —(d +1). It then follows that R} 
is the first part of the pulse response, since deg R} = d — dp. 
Equation (4.52) can be written as 


ot + d) = Ri(q u(t +d — do) + RY(q ult — 1) + GG) 
= Ri (q7)u(t +d — do) + Falt) (4.54) 


Rt (q7*)u(t +d—do) depends on u(t),...,u(é-+d—do), and Ja(£) is a function of 
u(t—1), u(t—2), ..., and y(t), y(t— 1), .... The variable z(t) can be interpreted 
as the constrained prediction of y(t + d) under the assumption that u(t) and 
future control signals are zero. The output at time ¢+d thus depends on future 
control signals (if d > dp}, the control signal to be chosen, and old inputs and 
outputs. If d > do, it is necessary to make some assumptions about the future 
control signals. One possibility is to assume that the control signal will remain 
constant, that is, that 


u(t) =u(t+ 1} =- = u(t + d — do) (4.55) 


Another way is to determine the control law that brings y(t + d) to a desired 
value while minimizing the control effort over the prediction horizon, that is, 
to minimize 


(4.53) 


t+d 


Suk)? (4.56) 


k=t 
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A third way is to assume that the increment of the control signal will be zero 
after some time. This is used, for instance, in generalized predictive control 
(GPC), which is discussed below. 


Constant Future Control 


Consider Eq. (4.54) and assume that the predicted output is equal to the 
desired output, that is, y(t + d) = y(t +d). If we assume that Eq. (4.55) 
holds, then u(t) should be chosen such that 


Ym(t +d) = (R3 (1) + g` Ra(7")) u(t) + Galaya) 


This gives the control law 


Ym(t + d) — Gq") (4) 
Ra (1) + Rala) 


This control signal is then applied to the process. At the next sampling instant 
a new measurement is obtained, and the control law of Eq. (4.57) is used again. 
Note that the value of the control signal is changed rather than kept constant, 
as was assumed when Eq. (4.57) was derived. The receding-horizen control 
principle is thus used. Note that the control law is time-invariant, in contrast 
to a fixed-horizon linear quadratic controller. 

We now analyze the closed-loop system when Eq. (4.57) is used to control 
the process of Eq. (4.50). It is now necessary to make the calculations in the 
forward shift operator, since poles at the origin may otherwise be overlooked. 
The identity of Eq. (4.51) can be written in the forward shift operator as 


u(t) = (4.57) 


qti = A(q)Fa(q) + Gala) (4.58) 
The characteristic polynomial of the closed-loop system is 
P(q) = A(q) (q7"Ra(1) + Ra(g)) + Gala)}B (a) (4.59) 


where 
deg P= degA+n—-1=2n-1 


The design equation (Eq. 4.58) can now be used to rewrite P(q): 
B(q)q"*** = A(q)B(q)Fa(g) + Gala)B (a) 
= A(q) (q> Rala) + Ra(q)) + Gala) Bla) 
Hence E 
A(q)Ra(q) + Ga(q)B(q) = B(g)q"*** — Ala)a > Rala) 
which gives 


P(g) = T 1A(q)Ra(1) +g”! (B (a) — A(q)Ra(q)) 


4.6 Adaptive Predictive Control i71 
If the process is stable, it follows from Eq. (4.53) that the last term vanishes 
as d -» co. Thus 

lim P(qg)=q" ‘A(q)Ra(1) if A(z) is a stable polynomial 
The properties of the predictive control law are illustrated by an example. 
EXAMPLE49 = Predictive control 
Consider the process model 
yli + 1) + ay(t) = bu(t) 

The identity of Eq. (4.58) gives 


g? = (g+a)(q*" + fag +---+ for) + 90 


Hence 
F(q) = gt} = ag? 4 agi? AE (—a)™ 
G(q) = (~a)? 
Rala) = bF (q) 
Ra(q) =0 
and the control law becomes, when ym = 0, 
(-a)? (~a) (1 +a) 


MO aa... ayy aa a 


The characteristic polynomial of the closed-loop system is 


P(qg)=qtat (ax) a Ci ) 
which has the pole 
__a+(—a) 
“T= 


If a < 0 the location of the pole is given by 


0 < pa < —a la| <1 (stable open-loop system) 
0<pa<1 ja|>1 (unstable open-loop system) 


The closed-loop pole for different values of a and d is shown in Fig. 4.14. The 
example indicates that it can be sufficient to use a prediction horizon of five 
to ten samples. m 


It is possible to generalize the result of Example 4.9 to higher-order sys- 
tems. The conclusion is that the closed-loop response will be slow for slow 
or unstable systems when the prediction horizon increases. The restriction of 
Eq. (4.55) is then not very useful. 
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Figure 4.14 The closed-loop pole pg = (a* — a)/ (af — 1) as function of d for 
different values of a. 


Minimum Control Effort 


The control strategy that brings y(t+d} to y,,(¢+d) while minimizing Eq. (4.56) 
will now be derived, Equation (4.54) is 


y(t + d) = R3(q )ju(t + d — do) + Falt) 
= rgou(t + v) +--+ ravut (t) + Falt) 
where v = d — dp. The condition 
y(t + d) = Yym(t + d) = Ja(t) + Ra(g u(t +d — do) 
can be regarded as a constraint while minimizing Eq. (4.56). Introducing the 
Lagrangian multiplier A gives the loss function 
2J = u(t)’ +--+ u(t +V} + 2A (Ym(t + d) — Falt) — Ro(q “u(t + v)) 


Equating the partial derivatives with respect to u(#),---,u(fé+v) and A to zero 
gives 


u(t +v) = Argo 
¥m(t + d} — a(t} = raou(t + Vv) +---4+rau(é) 
This set of equations gives 
ue) = Int +d) ~ alt 
H 
where 
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Using the definition of ¥,(t) gives 
u(t) = ym(t + d) — Ryu(t — 1) — Gay(t) 
or 
TE Ym(t +d) — Gay(t) _ m(t +d+n -— 1) - Gala) 
utg Ry ug"! + Rala) 


Using Eq. (4.60) and the model of Eq. (4.50) gives the closed-loop characteristic 
polynomial 


(4.60) 


P(q) = A(q) (u + Ra(q)) + Gala)B (a) 


This is of the same form as Eq. (4.56), with Ra(1) replaced by u. This implies 
that the closed-loop poles approach the zeros of g’'A(g) when A(q)} is stable 
and when d — co. What will happen when the open-loop system is unstable? 
Consider the following example. 


EXAMPLE 4.10 Minimum-effort control 
Consider the same system as in Example 4.9. The minimum-effort controller 
is in this case given by 
7 pl +a? +- +a) ě b(a” —1) 
ai Cami aai) 


which gives (when ym = 0) 


u(t) =- EE 9) = IE see 
The pole of the closed-loop system is 
a?i- (a? — 1) alg 
MERET eee. aE: 
which gives 
iim pad = =a ja|<1 (stable open-loop system) 
lim pa =—Il/a |a|>1 (unstable open-loop system) 


For this example the minimum-effort controller gives a better closed-loop sys- 
tem than if the future control is assumed to be constant. m 


Generalized Predictive Control (GPC) 


The predictive controllers discussed so far have considered the output at only 
one future instant of time. Different generalizations of predictive control bave 
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been suggested, in which different loss functions are minimized. One possibility 
is to use 


Na Nu 
J(Ni,Neo,Nu) =E S(t +k) —9m(t + ky)? + S| pAu(t +k— ur (4.61) 
k=Ny k=1 


where 
A=1-—q! 

is the difference operator. Different choices of Ni, No, and N, give rise to the 
different schemes suggested in the literature. 

The methodology of generalized predictive control is illustrated by using 
the loss function of Eq. (4.61) and the process model 
tfo m fi e(t 
A(T = Bul — do) + (4.62) 
This model is sometimes called the CARIMA (controlled auto-regressive inte- 
grating moving-average} model. It has the advantage that the controller will 
automatically contain an integrator. (Compare Section 3.6.) As with Eq. (4.50), 
the following identity is introduced: 


1 = A* (g> Fi AI- 47") + atala) (4.63) 
This can be used to determine the output d steps ahead: 
y(t + d) = F3 B* Au(t + d — do) + Gyy(t) + Fge(t +d) 


F% is of degree d—1. The optimal mean squared error predictor, given measured 
output up to time £ and any given input sequence, is 


S(t +d) = FB* Au(t +d — do) + Gi(t) (4.64) 


Suppose that the future desired outputs, Ym(t + 2), k = 1, 2,... are available. 
The loss function of Eq. (4.61) can now be minimized, giving a sequence of 
future control signals. Notice that the expectation in Eq. (4.61) is made with 
respect to data obtained up to time ¢, assuming that no future measurements 
are available. That is, it is assumed that the computed control sequence is 
applied to the system. However, only the first element of the control sequence is 
used. The calculations are repeated when a new measurement is obtained. The 
resulting controller belongs to the confusingly named class called open-loop- 
optimal-feedback control. As the name suggests, it is assumed that feedback is 
used, but it is computed only on the basis of the information available at the 
present time. 
In analogy with Eq. (4.54) we get 


y(t +1) = Ril )Au(t + 1 —do) + A(t) + Fielt + 1) 
y(t +2) = R3(q*)Au(t + 2 — do) + Fo(t) + Felt + 2) 


y(t +N) = R(T Ault + N — do) + F(t) + Felt + N) 
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Each output value depends on future control signals (if d > dp), measured 
inputs, and future noise signals. The equations above can be written as 


y= RAu+7+e 


where 
y=(x(t+1) ... yt+N))" 
Au=(Au(t+1—d)) ... Au(t+N~-d)))* 
¥=(nlt) ... Fn(t))* 
e=(Fte(t+1) ... Fre(t+N))* 


From Eq. (4.53) it follows that the coefficients of R$ are the first d — dọ + 1 
terms of the pulse response of g~“°B*/(A*A), which are the same as the first 
d — do + 1 terms of the step response of g~“°B*/A*. The matrix R is thus a 
lower triangular matrix: 


ro 0 0 
Py ra 0 

R= 
FN-1 FN-2 «--. Fo 


If there is a dead time in the system, do > 1, then the first do — 1 rows of R 
will be zero, Also introduce 


Ym = (Ym(E+1) --- Ym(E+N))" 
The expected value of the loss function can be written as 
J(L, N, N) = E{(y — Ym)” (Y — Ym) + PAu" Au} 
= (RAu +9 — ym) (Râu +9—ym) + pAu"Au (4.65) 
Minimization of this expression with respect to Au gives 


Au = (RTR + pI) ’R? (Ym —9) (4.66) 


The first component in Au is Au(é), which is the control signal applied to 
the system. Notice that the controller automatically has an integrator. This is 
necessary to compensate for the drifting noise term in Eq. (4.62). 

Notice that R is independent of the measurements and the old control 
signals. Only y and ¥ depend on the measurements. The controller (4.66) is 
thus a time-invariant controller if the process is time-invariant. The predictive 
controller can thus be interpreted in terms of a pole placement controller. For 
instance, NV, = Ny =n+1, Ne > 2(n+1)—1, and p = 0 leads to a deadbeat 
controller. 

The calculation of Eq. (4.66) involves the inversion of an N x N matrix, 
where N is the prediction horizon in the loss function. To decrease the compu- 
tations, it is possible to introduce constraints on the future control signals. For 
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instance, it can be assumed that the control increments are zero after N, < N 
steps: 

Au(Zi+k-1)=0 k>N, 
This implies that the control signal is assumed to be constant after N, steps. 


Compare the constraint of Eq. (4.55). The control law (Eq. 4.66) then changes 
to 


Au = (Ri’R, + pI) *Ri? (ym — F) (4.67) 
where Ra is the N x N, matrix 
fo 0 nna 0 
ry To wee 0 
Ry, = 
ro 
FN-1 TN-2 --- TN-N, 


The matrix to be inverted is now of order N, x Nu. 

One advantage of the receding horizon controllers is that it is possible to 
include constraints in the states and the control signal. References to this are 
given at the end of the chapter. One disadvantage with the GPC is that there 
are many parameters to determine, and it is not obvious how to choose the 
parameters to get a stable closed-loop system. 

The output and control horizons can be chosen as follows: The lower limit 
N in Eq. (4.61) indicates the first output that will be used in the Joss function. 
The first output that is influenced by u(t) is y(t+do). If the time delay is known, 
then N = do is the obvious choice. When the time delay is unknown, Ni = 1 
or Ny = domin could be used, where domin is an estimate of the lower limit of the 
delay. For unknown delays the order of the B polynomial should be increased 
to make it possible to include all possible values of dp. This will make the 
adaptive GPC quite insensitive to variations in the time delay. 

The maximum output horizon Ne can be chosen such that N2h is of the 
same magnitude as the rise time of the plant, where A is the sampling time 
of the controller. If the system is nonminimum phase, then Ne should be 
chosen such that Nz exceeds the degree of the B polynomial. This will imply 
that the maximum output horizon is longer than a possible negative-going 
nonminimum-phase transient. 

The contro! horizon N, is an important design parameter. As a rule, N, 
should be longer the more complex the process is. For processes that are 
unstable or close to the stability boundary it is necessary to use a N, that 
is at least equal to the number of unstable or poorly damped poles. For simpler 
processes, Ny, = 1 often gives good results. 

To make the generalized predictive controller adaptive, it is necessary 
at each step of time to estimate the A* and &* polynomials. The predicted 
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values for different prediction horizons are computed, and the control signal 
is calculated from Eq. (4.67). The adaptive generalized predictive controller 
is thus an indirect control algorithm. The predictions of Eq. (4.64) can be 
computed recursively, which will simplify the computations. Finally, N,, is 
usually chosen to be small, which implies that only a low-order matrix needs 
to be inverted. The adaptive version of GPC has shown good performance and 
a certain degree of robustness with respect to the choice of model order and 
poorly known time delays. 

To investigate the closed-loop properties of the system in using GPC, we 
first determine the control signal Au(t) from Eq. (4.67): 


Au(t)=(1 0 ... 0) (Ra? Ry + pl) Ra? (¥m—J) 
= (a wae an ) (ym — F) 
Further, from Eq. (4.62), using Eq. (4.54), 


E SA'A | 
Rt Au(t — 1) + Gty(t) go G 


tety 
Il 
| 


: y(t) 
Ry Au(t — 1) + G4 y(t) Rapata A j- “1 


The closed-loop system has the characteristic cnation 

Ri A*Aqg*® 14+ B*Gs 

A*A+(@ ... Gy) : 

Ry A*A g + B*Gs, 

The identity of Eq. (4.63) gives 
B* = A*AB*F34+q°G)B" 
= A*A(R§ + q DR) + 4 G5 B" 
This gives the characteristic equation 
(BY — A*AR})q 
A*A+(@, ... QN) 


(B* - AAR wg 
N 
= A*A +Y aig'(B* — A"AR?) (4.68) 
i=] 
Equation (4.68) gives an expression for the closed-loop characteristic equation, 
but it is still difficult to draw any general conclusions about the properties of 
the closed-loop system even when the process is known. 
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If N,, = 1, then 
ri 


=~ SN 
P+ yah 


If p is sufficiently large, the closed-loop system becomes unstable if the open- 
loop process is unstable. However, if both the control and output horizons are 
increased, the problem is the same as a finite-horizon linear quadratic control 
problem and should thus have better stability properties. 

The model predictive controllers such as GPC have the drawback that there 
are many parameters to choose. Even if there are rules of thumb for choosing 
the parameters, it is sometimes difficult to determine the parameters such that 
the closed-loop system is stable (see Problem 4.12), This difficulty exists both 
when the process is known and when an adaptive GPC algorithm is used. 

It is easily seen that the GPC control problem can be interpreted as a 
stationary LQG control problem but with time-varying weighting matrices or 
as a finite horizon LQG problem. Compare the joss functions (4.47) and (4.61). 
The stationary LQG problem and the associated Riccati equation have been 
extensively studied, and there is much knowledge about the properties of the 
closed-loop system. The drawback of the infinite-horizon LQG formulation is 
that it cannot handle constraints in the states or the control signal. Different 
ways to formulate and solve the constrained receding horizon problem are given 
in the references at the end of the chapter. 


Qi 


4.7 CONCLUSIONS 


This chapter has reviewed different self-tuning regulators. The basic idea is to 
make a separation between the estimation of the unknown parameters of the 
process and the design of the controller. The estimated parameters are assumed 
to be equal to the true parameters in making the design of the controller. It is 
sometimes of interest to include the uncertainties of the parameter estimates in 
the design. Such controllers are discussed in Chapter 7. By combining different 
estimation schemes and design methods, it is possible to derive self-tuners with 
different properties. In this chapter, only the basic ideas and the asymptotic 
properties are discussed. The convergence of the estimates and the stability of 
the closed-loop system are discussed in Chapter 6. 

The most important aspect of self-tuning regulators is the issue of param- 
eterization. A reparameterization can be achieved by using the process model 
and the desired closed-loop response. The goal of the reparameterization is to 
make a direct estimation of the controller parameters, which usually implies 
that the new model should be linear in the controller parameters. 

Only a few of the proposed self-tuning algorithms have been treated in 
this chapter. Different combinations of estimation methods and underlying 
control problems give algorithms with different properties. One goal of the 
chapter has been to give a feel for how self-tuning algorithms can be developed 
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and analyzed. It is important that the desired closed-loop specifications are 
carefully chosen in applying a self-tuner. A design method that is unsuitable 
when the process is known will not become better when the process is unknown. 

It is also possible to derive self-tuning regulators for multi-input, multi- 
output (MIMO) systems. The MIMO case is more difficult to analyze. One main 
difficulty is to define what the necessary a priori knowledge is in the MIMO 
case, It is quite straightforward to derive a self-tuning algorithm corresponding 
to the generalized direct self-tuning regulator for the restricted case when the 
delays between the different inputs and outputs are known. 


PROBLEMS 


4.1 Consider the process and controller in Example 4.4. The controller pa- 
rameter §) may be very large if b is small. Discuss alternatives to ensure 
that the controller parameter stays bounded. 


4.2 Consider the basic direct self-tuning controller in Algorithm 4.1. Discuss 
different ways to incorporate reference values in the controller. What are 
the properties of the following three ways for taking care of the reference 
value? 

(a) Use the difference y—u, instead of y in the algorithm, and introduce 
an integrator in the controller. 
(b) Estimate the parameters using the model 


y(t +d) = Rut+S*y—T*u, +E 
and let the controller be 
R'u = —S*y + T'ue 
(c) Use the difference ue — y instead of y in the algorithm, and introduce 
an integrator in the controller, 
4.3 Show that the control equation (4.37) minimizes the loss function (4.36). 


4.4 Consider the system in Example 4.6. Assume that the process is known. 
Compute the optimal minimum-variance controller and the least attain- 
able output variance when (a) t = 0.4 (the minimum-phase case) and (b) 
T = 0.6 (the nonminimum-phase case). (Hint: Use Theorem 4.3 for the 
nonminimum-phase case.) 


4.5 Make the same calculations as in Problem 4.4 but for the moving-average 
controller with d = 2. 


4.6 Consider the generalized minimum-variance controller of Eq. (4.37). 
Compute the closed-loop characteristic equation. Discuss when the de- 
sign method may give an unstable closed-loop system. For instance, is it 
useful for the process in Example 4.6 when t = 0.6? 


180 


4.7 


4.9 


4.10 
4.11 


4.12 


4,13 


4.14 
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Consider the process in Example 4.6 when r = 0.6 and C = 0, Use 
Eq. (4.67) to compute the closed-loop poles for different-values of N when 
Nu = 1. 

Show that the moving-average controller with Bt* = 1 and d = n corre- 
sponds to a state deadbeat controller. 


Consider the process in Example 4.3. Assume that 


Alq) = £ — 1.5q + 0.7 

B(q) =q +b; 

Cla) =g -—qa+0.2 
Determine the variance of the closed-loop system as a function of bı 
when the moving-average controller is used. Compare with the lowest 
achievable variance. 
Show that the control law (4.66) minimizes the loss function (4.65). 


Consider the process in Example 4.5. Investigate through simulation what 
values of fp can be used. Make the simulations with and without bounds 
on the contro] signal. How sensitive is the choice of initial values in the 
algorithm? 
Consider the system (4.62) with e(t) = 0 and 
A*(q) =] —4q7} +4q72 = (1 _ 2q +)? 
B* (q>) = q! — 1.9999? 
The open-loop process is unstable, and there is a near pole-zero can- 
cellation. Assume that p = 0.1 and compute the generalized predictive 
controller that minimizes Eq. (4.65) for different values of N. How large 
must N be to get a stable closed-loop system? (The problem is adopted 
from Bitmead et al. (1990).) (Hint: Don’t give up until N > 25.) 
Consider the system in Problem 1.9. 
(a) Sample the system and assume that e is discrete-time measurement 
noise. Determine the minimum-variance controller for the system. 
(b) Simulate a self-tuning moving-average controlier for different predic- 
tion horizons. 


Make the same investigation as in Problem 4.12 but for the process in 
Problem 1.10. 
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CHAPTER 5 


MODEL-REFERENCE 
ADAPTIVE SYSTEMS 


5.1 INTRODUCTION 


The modei-reference adaptive system (MRAS) is an important adaptive con- 
troller. It may be regarded as an adaptive servo system in which the desired 
performance is expressed in terms of a reference model, which gives the desired 
response to a command signal. This is a convenient way to give specifications 
for a servo problem. A block diagram of the system is shown in Fig. 5.1. The 
system has an ordinary feedback loop composed of the process and the con- 
troller and another feedback loop that changes the controller parameters. The 
parameters are changed on the basis of feedback from the error, which is the 


Figure 5.1 Block diagram of a model-reference adaptive system (MRAS). 
185 


186 Chapter 5 Model-Reference Adaptive Systems 


difference between the output of the system and the output of the reference 
model. The ordinary feedback loop is called the inner loop, and the parameter 
adjustment loop is called the outer loop. The mechanism for adjusting the pa- 
rameters in a model-reference adaptive system can be obtained in two ways: 
by using a gradient method or by applying stability theory. 

In the MRAS the desired behavior of the system is specified by a model, 
and the parameters of the controller are adjusted based on the error, which 
is the difference between the outputs of the closed-loop system and the model. 
Model-reference adaptive systems were originally derived for deterministic 
continuous-time systems. Extensions to discrete-time systems and systems 
with stochastic disturbances were given later. 

The presentation in this chapter follows the historical development. The 
MIT rule is derived in Section 5.2. This rule has one parameter, the adapta- 
tion gain, that must be chosen by the user. In Section 5.3 we discuss methods 
to determine the adaptation gain. Section 5.4 presents Lyapunov’s stability 
theory, and Section 5.5 shows how this theory can be used to derive stable 
adaptation laws. These laws are similar to those obtained by the MIT rule. 
In Section 5.6 we introduce the theory for input-output stability. This gives 
another way of viewing adaptive control systems, which is presented in Sec- 
tion 5.7. In Section 5.8 we show how MRASs can be obtained for output feedback 
of general linear systems. Section 5.9 gives a comparison between self-tuning 
regulators and MRASs. Adaptive control of nonlinear systems is briefly dis- 
cussed in Section 5.10. The chapter is summarized in Section 5.11. Further 
insight into model reference adaptive systems is given in Chapter 6. 


5.2 THE MIT RULE 


The MIT rule is the original approach to model-reference adaptive control. The 
name is derived from the fact that it was developed at the Instrumentation 
Laboratory (now the Draper Laboratory) at MIT. 

To present the MIT rule, we will consider a closed-loop system in which the 
controller has one adjustable parameter 8. The desired closed-loop response is 
specified by a model whose output is y,,. Let e be the error between the output 
y of the closed-loop system and the output ym of the model. One possibility is 
to adjust parameters in such a way that the loss function 


J(0) = 5¢ (5.1) 


is minimized. To make J small, it is reasonable to change the parameters in 
the direction of the negative gradient of J, that is, 


dé aJ de 


oF _ __,, 5.2 
de’ ae °° a0 (5.2) 
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This is the celebrated MIT rule. The partial derivative de/90, which is called 
the sensitivity derivative of the system, tells how the error is influenced by 
the adjustable parameter, If it is assumed that the parameter changes are 
slower than the other variables in the system, then the derivative 0e/0@ can 
be evaluated under the assumption that @ is constant. 

There are many alternatives to the loss function given by Eq. (5.1). If it is 
chosen to be 

J(@) = lel (5.3) 
the gradient method gives 
d 

at = —y 7 signe (5.4) 
The first MRAS that was implemented was based on this formula. There are, 
however, many other possibilities, for example, 


& = y sign ( & ) sign(e) 

gp PE age 

This is called the sign-sign algorithm. A discrete-time version of this algorithm 
is used in telecommunications, in which simple implementation and fast com- 
putations are required. (See Section 13.2.) 


Adjusting many parameters Equation (5.2) also applies when there are many 
parameters to adjust. The symbol @ should then be interpreted as a vector and 
6e/@ as the gradient of the error with respect to the parameters. 


Examples 


We now give two examples that illustrate how the MIT rule is used to obtain 
a simple adaptive controller, and we also show some properties of adaptive 
systems. 


EXAMPLE 5.1 Adaptation of a feedforward gain 


Consider the problem of adjusting a feedforward gain. In this problem it is 
assumed that the process is linear with the transfer function kG(s), where G(s) 
is known and & is an unknown parameter. The underlying design problem is 
to find a feedforward controller that gives a system with the transfer function 
Gn(s) = koG (s), where kp is a given constant. With the feedforward controller 


u = Ou, 


where u is the control signal and u, the command signal, the transfer function 
from command signal to the output becomes @kG(s). This transfer function is 
equal to G,,(s) if the parameter @ is chosen to be 


ko 
E 
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Model 


Figure 5.2 Block diagram of an MRAS for adjustment of a feedforward gain 
based on the MIT rule. 


We will now use the MIT rule to obtain a method for adjusting the parameter 
8 when k is not known. The error is 


Ee =Y Ym = kG(p)@u, — koG (p}te 


where u, is the command signal, Ym is the model output, y is the process output, 
Ə is the adjustable parameter, and p = d/d? is the differential operator. The 
sensitivity derivative is given by 


de k 
26 = kG(p)u. = ka?” 
The MIT rule then gives the following adaptation law: 
dð Ik 
TE =—y Ro ¥mne = —V¥me (5.5) 


where y = 7'k/ko has been introduced instead of y’. Notice that to have the 
correct sign of y, it is necessary to know the sign of k. Equation (5.5) gives 
the law for adjusting the parameter. A block diagram of the system is shown 
in Fig. 5.2. 

The properties of the system can be illustrated by simulation. Figure 5.3 
shows a simulation when the system has the transfer function 


G(s) = 
(s) s+1 

The input u, is a sinusoid with frequency 1 rad/s, and the parameter values 
are k = 1 and kp = 2. Figure 5.3 shows that the parameter converges toward 
the correct value reasonably fast when the adaptation gain is y = 1 and that 
the process output approaches the model output. Figure 5.3 also shows that 
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Figure 5.3 Simulation of an MRAS for adjusting a feedforward gain. The 
process (solid line) and the model (dashed line) outputs are shown in the 
upper graph for y = 1, The controller parameter is shown in the lower graph 
when the adaptation gain y has the values 0.5, 1, and 2. 


the convergence rate depends on the adaptation gain. It is thus important 
to know a reasonable value of this parameter. Intuitively, we may expect that 
parameters converge slowly for small y and that the convergence rate increases 
with y. Simulation experiments indicate that this is true for small values of y 
but also that the behavior is quite unpredictable for large y. oO 


An example of a practical problem that fits this formulation is control of 
robots with unknown load, in which the process transfer function from the 
motor current to the angular velocity is 


k 
G(s) = z 


where ky; is the current to torque constant and J is the unknown moment 
of inertia. Another example is the dynamics of a CD player, in which the 
sensitivity of the laser diode is the unknown process parameter. 


A remark on notation In analyzing the MRAS with time-varying parameters 
it is important to consider the fact that the parameter © is time-varying. The 
expression 

G(p) (6x) 


where p = d/dt is the differential operator should be interpreted as the 
differential operator G(p) acting on the signal ĝu. When @ is time-varying, 
this is different from @G(p)u. For example, if G(p) = p, we have 


G(p)(@u) = = p(@u) = 8 a + i u = O(pu) + u{pê) 


Care must thus be taken in manipulating expressions and block diagrams. 
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Notice that no approximations were needed in Example 5.1. When the 
MIT rule is applied to more complicated problems, however, it is necessary 
to use approximations to obtain the sensitivity derivatives. This is illustrated 
by another example. 


EXAMPLE 5.2 MRAS for a first-order system 
Consider a system described by the model 


dy 
dp ay + bu (5.6) 
where u is the control variable and y is the measured output. Assume that we 


want to obtain a closed-loop system described by 


Dm _ 
“dt = —Om¥m + Dmte 
Let the controller be given by 
u(t) = 81u. (t) — 627(¢) (5.7) 
The controller has two parameters. If they are chosen to be 
8 = 80 = bm 
aa (5.8) 


the input-output relations of the system and the model are the same. This is 
called perfect model-following. 
To apply the MIT rule, introduce the error 


E = Y — Ym 
where y denotes the output of the closed-loop system. It follows from Eqs. (5.6) 


and (5.7) that 
be, 


= — ——_ 

ptat bee 
where p = d/dt is the differential operator. The notation used is discussed in 
Section 1.5. The sensitivity derivatives are obtained by taking partial deriva- 
tives with respect to the controller parameters 8, and z: 


y 


bæ è , 
06; p+a+ be j 
e ___ Pa oa Ë 
ôd (ptatbea)e © ptat+ber~ 


These formulas cannot be used directly because the process parameters a and 
b are not known. Approximations are therefore required. One possible approx- 
imation is based on the observation that p+ a + bé$ = p + am when the 
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Figure 5.4 Block diagram of a model-reference controller for a first-order 
process. 


parameters give perfect model-following. We will therefore use the approxima- 
tion 
Ppta+bé2~pt+am 


which will be reasonable when parameters are close to their correct values. 
With this approximation we get the following equations for updating the con- 
troller parameters: 


(5.9) 


In these equations we have combined parameters b and am with the adaptation 
gain y’, since they appear as the product y’b/a,,. The sign of parameter b 
must be known to have the correct sign of y. Notice that the filter has also 
been normalized so that its steady-state gain is unity. 

The adaptive controller is a dynamical system with five state variables 
that can be chosen to be the model output, the parameters, and the sensitivity 
derivatives. A block diagram of the system is shown in Fig. 5.4. The behavior 
of the system is now illustrated by a simulation. The parameters are chosen 
to be a = 1, b — 0.5, and a,, = bm = 2, the input signal is a square wave with 
amplitude 1, and y = 1. Figure 5.5 shows the results of a simulation. Figure 5.6 
shows the parameter estimates for different values of the adaptation gain y. 
Notice that the parameters change most when the command signal changes 
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Figure 5.5 Simulation of the system in Example 5.2 using an MRAS. The 
parameter values area = 1, b = 0.5, a, = bm = 2, and y = 1. 


and that the parameters converge very slowly. For y = 1, the value used in 
Fig. 5.5, the parameters have the values 6, = 3.2 and 6, = 1.2 at time ¢ = 100. 
These values are far from the correct values 0? = 4 and 68 = 2. However, 
the parameters will converge to the true values with increasing time. The 
convergence rate increases with increasing y, as is shown in Fig. 5.6. 

The fact that the control is quite good even at time ¢ = 10 is a reflection of 
the fact that the parameter estimates are related to each other in a very special 
way, although they are quite far from their true values. This is illustrated in 
Fig. 5.7, where parameter @2 is plotted as a function of 6; for a simulation with 
a duration of 500 time units. Figure 5.7 shows that parameters do indeed ap- 
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Time 


Figure 5.6 Controller parameters 6, and 62 for the system in Example 5.2 
when y = 0.2, 1 and 5. 
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a, 
Figure 5.7 Relation between controller parameters 6, and 8, when the 
system in Example 5.2 is simulated for 500 time units. The dashed line shows 
the line 6, = @, — a/b. The dot indicates the convergence point. 


proach their correct values as time increases. The parameter estimates quickly 
approach the line 6) = @, -- a/b. This line represents parameter values such 
that the closed-loop system has correct steady-state gain. D 


Error and Parameter Convergence 


The goal in model-reference adaptive systems is to drive the error e = Y — Ym to 
zero. This does not necessarily imply that the controller parameters approach 
their correct values, as is illustrated in the following example. 


EXAMPLE 5.3 Lack of parameter convergence 
Consider the simple system for updating a feedforward gain, discussed in 


Example 5.1. Assume that G(s) = 1. The process model is y = ku, the control 
law is u = ĝu., and the desired response is given by ym = koue. The error is 


e = (k8 — ko)u, = k(0 — O° Jue 


where 6° = ky/k. The MIT rule gives the following differential equation for the 
parameter: 


dt = —yk us (@ 6°) 
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This equation has the solution 
6(t) = 6° + (0(0) — 0?) 7" * (5.10) 


where 
t 


i= f u®(c) dt 
0 
and @(0} is the initial value of the parameter @. The estimate converges toward 
its correct value only if the integral J, diverges as £ — co. The convergence is 
exponential if the input signal ie persistently exciting. (Compare with Sec- 
tion 2.4.) The error is given by 


e(t) = ku,(t)(6(0) — 6°)e*** 


Notice that the error will always go to zero as £ > co because either the integral 
I, diverges or else u,(t) > 0. However, the limiting value of the parameter 8 
will depend on the properties of the input signal. m 


Example 5.3 illustrates the fact that the error e goes to zero but that 
the parameters do not necessarily converge to their correct values. This is 
a characteristic feature of all adaptive systems. The input signal must have 
certain properties for the parameters to converge. The conditions required were 
discussed in Chapter 2; compare with the notion of persistent excitation, which 
was introduced in Section 2.4. 


5.3 DETERMINATION OF THE ADAPTATION GAIN 


In Section 5.2 we found that it was straightforward to obtain an adaptive 
system by using the MIT rule. The ađaptive control laws had one parameter, 
the adaptation gain y, which had to be chosen by the user. The simulation 
experiments indicated that the choice of the adaptation gain could be crucial. 
In this section we will discuss methods for determining the adaptation gain. 

Consider the MRAS for adaptation of a feedforward gain in Example 5.1. 
We thus have a system with the transfer function kG(s), where G(s} is known 
and k is an unknown constant. It is assumed that G(s) is stable. We would 
like to find a feedforward control that gives the transfer function koG (s). The 
system is described by the following equations: 


y =kG(p)u 
Ym = koG(p)tte 

u = le 

€ = y — Ym 
d8 


Ti = —Y Yme 
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where u. is the command signal, ym is the model output, y is the process 
output, @ is the adjustable parameter, and p = d/dt is the differential operator. 
Elimination of the variables u and y in these equations gives 


Set Yön (RG(P)OUe) = 9h (6.11) 


This equation is a compact description of the behavior of the parameters that we 
call the parameter equation. Notice that y,, may be considered a known function 
of time. If G(s) is a rational transfer function, Eq. (5.11) is a linear time-varying 
ordinary differential equation. Such equations may exhibit very complicated 
behavior. It is not possible to give a simple analytical characterization of the 
properties of the system, particularly how they are influenced by the parameter 
Y. 


A Thought Experiment 


To get some insight into the behavior of the system given by Eq. (5.11), we 
consider an experiment with the adaptive system such that the equation sim- 
plifies considerably. An understanding of the behavior of the system under such 
circumstances will give us some insight, but it will of course not give the full 
picture. 

Consider the following experiment: Assume that the value of parameter 8 
is fixed, that the adaptation mechanism is disconnected, and that a constant 
input signal u, is applied. The adaptation mechanism is then connected when 
all signals have settled to steady-state values. The behavior of the parameter 
is then given by 


T+ yapu? (KGA) = VOA) (5.12) 


which is a linear time-invariant system. This equation is linear with constant 
coefficients. Its stability is governed by the algebraic equation 


s+ yi uckG(s) = 0 (5.13) 


We can immediately conclude that the behavior of the parameter is determined 
by the quantity 
H = V¥muigh (5.14) 


A picture of how the zeros of Eq. (5.13) vary with parameter y is easily obtained 
by plotting the root locus with respect to the parameter. We can conclude that 
if Eq. (5.18) has zeros in the right half-plane, then the parameters will diverge 
even in the very special conditions of the experiment. Intuitively, we may also 
expect the analysis to approximately describe the case in which the command 
signal is changing slowly with respect. to the dynamics of G(s). 

Equation (5.13) can also be used to determine a suitable value of the 
adaptation gain, as is illustrated in Example 5.4. 
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: EXAMPLE 5.4 Choice of adaptation gain 


Consider the system in Example 5.1 with G(s) = 1/(s + 1), k = 1, and kp = 2. 
Assume that the reference signal has unit amplitude. Equation (5.13) then 
becomes 


set+tst+u=s?+s+yy7uck =0 


A reasonable choice is to make yy?,u°k = 1. If we disregard the transients, the 
average value of Yms is 2. This gives y = 0.5, which is the value used in one 
of the simulations in Fig. 5.3. Oo 


Normalized Algorithms 


It follows from Eq. (5.13) that the adaptive system will be unstable if the 
transfer function G(s) has pole excess larger than 1 and parameter u in 
Eq. (5.14) is sufficiently large. The parameter yp is large if the signals are large 
or if the adaptation gain is large. The behavior of the system depends strongly 
on the signal levels. This will be illustrated by a numerical experiment. 


€ EXAMPLE5.5 Stability depends on the signal amplitudes 
Consider the system in Example 5.1. Let the transfer function G be given by 


1 
5? + 215 + Qo 


G(s) = 
Equation (5.13) then becomes 
s? + ais” + a28 +U = 0 
where u = yy utk. The equation has all its roots in the left half-plane if 
muck < A102 (5.15) 


Since this inequality involves the magnitude of the command signal, it may 
happen that the equilibrium solution corresponding to one command signal is 
stable and the solution corresponding to another command signal is unstable. 
This is illustrated by the simulation results shown in Fig. 5.8, where parame- 
ters are chosen so that k = a, = a2 = 1. In the simulation the adaptation rate 
y was adjusted to give a good response when Ue is a square wave with unit 
amplitude. In this case we have u? = 7%, = 1, and inequality (5.15) gives the 
stability condition y < 1. A reasonable value of y is y = 0.1, which was used 
in the simulation, Figure 5.8 shows clearly that the convergence rate depends 
on the magnitude of the command signal. Notice that the solution is unstable 
when the amplitude of u, is 3.5. The approximate model predicts instability 
for u, larger than 3.16. Also notice that the response is intolerably slow for low 
amplitudes of u,. Oo 
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Figure 5.8 Simulation of the MRAS in Example 5.5. The command signal 
is a square wave with the amplitude (a) 0.1, (b) 1, and (ec) 3.5. The model 
output Ym is a dashed line; the process output is a solid line. The following 
parameters are used: k = a; = a, = @° = 1, and y = 0,1. 


The example indicates clearly that the choice of adaptation gain is crucial 
and that the value chosen depends on the signal levels. Because of this it seems 
natural to modify the algorithm so that it does not depend on the signal levels. 
To do this, we will write the MIT rule as 


dð 
a vee 
where we have introduced p = —0e/06. Introduce the following modified ad- 
justment rule: 
dð _ ype 
dt” a+gto 
where parameter œ > 0 is introduced to avoid difficulties when @ is small. 
Notice that we have written the equation in such a way that it also holds when 
8 is a vector; in that case, g is also a vector of the same dimension. 
If we repeat the analysis of the thought experiment, we find that Eg. (5.13) 
is replaced by 


(5.16) 


pus 
——_— kG{s) = 0 
Since g° is proportional to u$, the roots of this equation will not change much 
with the signal levels. The adaptation rule given by Eq. (5.16) is called the nor- 
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Figure 5.9 Simulation of the MRAS in Example 5.5 with the normalized 
MIT rule. The command signal is a square wave with the amplitude (a) 0.1, 
(b) 1, and (c) 3.5. Compare with Fig. 5.8. The model output ym is a dashed 
line; the process output is a solid line. The parameters used are k = a; = 
ad, = 6° = 1, œ = 0.001, and y = 0.1. 


malized MIT rule. The improved performance with this algorithm is illustrated 
in Fig. 5.9. A comparison with Fig. 5.8 shows that normalization is useful. 

Notice that the normalized adjustment rule performs very well even in 
the cases in which difficulties were encountered with the MIT rule. It is in 
fact possible to make the modified adjustment rule work very well over a wide 
range of command signal amplitudes. Notice that the normalization is obtained 
automatically with algorithms based on parameter estimation. (Compare with 
Example 2.16.) 


Summary 


Having derived the MIT rule and investigated some of its properties, we can 
now summarize some of the key issues. The model-reference control problem 
can be described as follows: Let the desired performance be specified by a 
reference model having the transfer function G(s), and let the closed-loop 
transfer function of the plant be G(s, 8}, where @ are the adjustable parameters. 
Furthermore, let u, be the command signal, The model-reference adaptive 
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system tries to change the controller parameters so that the error 


e(t) = (G(p, 8) — Gm(p) juc (8) 
goes to zero. The MIT rule given by 


28 ppe 


where g = —de/06 and y is the adaptation gain, can be interpreted as a 
gradient method for minimizing the error. The MIT rule can be applied in 
many different cases; a few examples have been given in this section. The 
choice of the adaptation gain is critical and depends on the signal levels. The 
normalized algorithm 

de pe 

dt u+ gp" 9 
is less sensitive to signal levels. Notice that a normalization of a similar type is 
obtained automatically in the self-tuning regulator. Compare with Eq. (3.22). 

Preliminary numerical experiments indicate that the systems obtained 

with the MIT rule work as expected for small adaptation gains. Very complex 
behavior may be obtained for high adaptation gains. To proceed to develop our 
understanding of adaptive systems, we will investigate the stability problem. 


5.4 LYAPUNOV THEORY 


There is no guarantee that an adaptive controller based on the MIT rule will 
give a stable closed-loop system. It is clearly desirable to see whether there 
are other methods for designing adaptive controllers that can guarantee the 
stability of the system. As a first step in this direction we now present the 
Lyapunov stability theory. For the benefit of students who are encountering 
Lyapunov theory for the first time, we first prove a stability theory for time- 
invariant systems. We then state a more powerful theorem for time-varying 
systems, which can be used to design adaptive controllers. 


Lyapunov’s Theory for Time-invariant Systems 


Fundamental contributions to the stability theory for nonlinear systems were 
made by the Russian mathematician Lyapunov in the end of the nineteenth 
century. Lyapunov investigated the nonlinear differential equation 


dx 


a = Fle) fO =0 (5.17) 


Since f(0) = 0, the equation has the solution x(t) = 0. To guarantee that a 
solution exists and is unique, it is necessary to make some assumptions about 
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f(x). A sufficient assumption is that f(x) is locally Lipschitz, that is, 


F(x) -—fFO)| <Lle-y] L>0 


in the neighborhood of the origin. Lyapunov was interested in investigating 
whether the solution of Eq. (5.17) is stable with respect to perturbations. For 
this purpose he introduced the following stability concept. 


DEFINITION 5.1 Lyapunov stability 
The solution x(é) = 0 to the differential equation (5.17) is called stable if for 
given £ > 0 there exists a number d(€) > 0 such that all solutions with initial 
conditions 

læ(0)] < 8 


have the property 
|Ix(é)|| <£ for O<t<co (5.18) 


The solution is unstable if it is not stable. The solution is asymptotically stable 
if it is stable and 5 can be found such that all solutions with ||x(0)j{] < 5 have 
the property that ||x(¢)|| + 0 as t > œ. o 


Remark 1. Ifthe solution is asymptotically stable for any initial value, then 
it is said to be globally asymptotically stable. 

Remark 2. Notice that Lyapunov stability refers to stability of a particular 
solution and not to the differential equation. m 


Lyapunov developed a method for investigating stability that is based on 
the idea of finding a function with special properties. To describe these, we 
first introduce the notion of positive definite functions. 


DEFINITION 5.2 Positive definite and semidefinite functions 
A continuously differentiable function V : R” — R is called positive definite in 
a region U C R” containing the origin if 
1. V(0)=0 
2. V(x)>0, xe U andx#0 
A function is called positive semidefinite if Condition 2 is replaced by V(x} > 0. 
o 


A positive definite function has level curves that enclose the origin. Curves 
corresponding to larger values of the function enclose curves that correspond 
to smaller values. The situation in the two-dimensional case is illustrated in 
Fig. 5.10. If we can find a function so that the velocity vector, dx/dt = f(x), 
always points toward the interior of the level curves, then it seems intuitively 
clear that a solution that starts inside a given level curve can never pass to 
the outside of the same level curve. We have the following theorem. 
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Figure 5.10 M[lustration of Lyapunov’s method for investigating stability. 


THEOREM 5.1 Lyapunov’s stability theorem: time-invariant systems 


If there exists a function V : R" — R that is positive definite such that its 
derivative along the solution of Eq. (5.17), 


dV OV? dx av? 
Ho Or u” op fH) = -WO) (5.19) 


is negative semidefinite, then the solution x(t} = 0 to Eq. (5.17) is stable. If 
dV jdt is negative definite, then the solution is also asymptotically stable. The 
function V is called a Lyapunov function for the system (5.17). 

Moreover if 


x <0 and V(x}—co when |[x|| + 00 


then the solution is globally asymptotically stable. 
Proof: Given £ > 0 such that {x| |x|] < £} E U, determine £ and 65 such that 


£= min V(x max V(x 5.20 
zl|=e (x) = lls (=) ( ) 


Consider initial conditions such that 
lz(0)|| < ô 

Since V is positive definite, it then follows from Definition 5.2 that 
V{x(0)) < £ 


To prove that inequality (5.18} holds, we proceed by contradiction. Assume that 
tı is the smallest value such that |jx(ż41}|| = £. It follows from Eq. (5.20) that 


V (x(t) > £ 
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Furthermore, 
È 1 t 1 


V (x(t1)) = V (x(0)) + | TY at = V (x(0)) — f W{(x(s)}ds (5.21) 


0 
Since W(x) is positive semidefinite, it follows that 
V (x(t1)) < V (0) < £ 


and we have thus obtained a contradiction and it can be concluded that ||x(é)|| < 
g€ for all ¢, which by Definition 5.1 implies that the solution x(t) = 0 is stable. 
To prove asymptotic stability, we notice that it follows from Eq. (5.21) that 


os f W(x(s)) d =V (x(0))—V (x) sé 
0 


Since W(x) and x(t) are continuous, it then follows that 
lim W (x(t))=0 
If W(x} is positive definite, this implies that x(f) — 0 as t > co. o 


Remark. Notice that it follows from the proof that if the derivative of the 
Lyapunov function is negative semidefinite, the solution converges to the set 
{x| W(x) = 0}. E 


Finding Lyapunov Functions 


Lyapunov’s theorem is very elegant. However, it is necessary to have meth- 
ods for constructing Lyapunov functions. There is no universal method for 
constructing Lyapunov functions for a stable system. To apply the method, 
we therefore have to resort to trial and error. A good first attempt is to test 
quadratic functions. However, for linear systems we have the following impor- 
tant result. 


THEOREM 5.2 Lyapunov functions for linear systems 
Assume that the linear system 


= Ax (5.22) 


is asymptotically stable. Then for each symmetric positive definite matrix Q 
there exists a unique symmetric positive definite matrix P such that 


ATP + PA = -Q (5.23) 
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Furthermore, the function 

V(x) =x" Px (5.24) 
is a Lyapunov function for Eq. (5.22). 
Proof: Let @ be a symmetric positive definite matrix. Define 


P(t) = I e AQA- ds 
0 


The matrix P is symmetric and positive definite because an integral of positive 
definite matrices is positive definite. The matrix P also satisfies 
= =A?P+PA+Q 


Since the matrix A is stable, the limit 

P, = lim P(Z) 
exists. This matrix satisfies Eq. (5.23). It can also be shown that the solution 
to Eq. (5.23) is unique, which completes the argument. oO 


For a stable linear system we can thus always find a quadratic Lyapunov 
function. To use Theorem 5.2 to construct a Lyapunov function, we simply 
choose a positive matrix Q and solve the linear equation (5.23) for P. The 
following example shows how it can be done. 


EXAMPLE5.6 Lyapunov functions for a linear system 
Consider the linear system (5.22) with 


a= [2 2) 
ag &4 
where it is assumed that all eigenvalues of A are in the left half-plane. Let the 
matrix @ be 
qa 0 
ate” 
0 g 
where qı and ge are positive. Assume that the matrix P has the form 


pa & | 
Po pa 


Equation (5.23) then becomes 


2a) 2a3 0 PL ~q 
Qo a,+@4 ay P | = 0 
0 2a, 204 Pa — G2 


This is a linear equation. Theorem 5.2 implies that it always has a solution 
when A is stable and that the solution is a positive definite matrix P. Oo 
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Lyapunov Theory for Tume-variable Systems 
We now consider time-variable differential equations of the type 


dx 
= = F(t) (5.25) 


The origin is an equilibrium point for Eq. (5.25) if f(0,£) = 0 Vt > 0. It is 
assumed that f is such that solutions exist for all t > to. To guarantee this, 
it is assumed that f is piecewise continuous in ¢ and locally Lipschitz in x in 
a neighborhood of x(t} = 0. We now investigate the stability of the solution 
x(t) = 0. 

In the time-varying case the solution will depend on ¢ as well as on the 
starting time to. This implies that the bound 6 in Definition 5.1 will depend 
on £ and to. The definition on stability can be refined to give uniform stability 
properties with respect to the initial time. We have the following definition. 


DEFINITION 5.3 Uniform Lyapunov stability 


The solution x(t) = 0 of Eq. (5.25) is uniformly stable if for £ > 0 there exists 
a number (£) > 0, independent of tọ, such that 


Ix(tol< 8 = l<e Virto>0 


The solution is uniformly asymptotically stable if it is uniformly stable and 
there is c > 0, independent of tp, such that x(t) — 0 as £ > œ, uniformly in fo, 
for all ||x(to)|| < c. m 


To state a stability theorem for solutions to Eq. (5.25), we first have to 
introduce the so-called class K functions. 


DEFINITION 54 Class K function 


A continuous function @; [0,a} — [0,00) is said to belong to class K if it is 
strictly increasing and o{0) = 0. It is said to belong to class K if a = oo and 
a{r) — o as r > o. m 


For time-varying systems the following stability theorem can now be stated. 


THEOREM 5.3 Lyapunov’s stability theorem: Time-varying systems 
Let x = 0 be an equilibrium point for Eq. (5.25) and D = {x € R”||\x|| < r}. 
Let V be a continuously differentiable function such that 

a1 (Ilx||) < V(x, t} < œ2(llxll) (5.26) 


dV av av 
aT ~mtR f(x, t) < —as(I|x|}) 


5.4 Lyapunov Theory 205 


for Vt > 0, where 1, @o, and œz are class K functions. Then x = 0 is uniformly 
asymptotically stable. 
Proof: A proof can be found in Khalil (1992). oO 


Remark 1. The derivative of V along the trajectories of Eq. (5.25) is now given 


by dV OV av 
dt ~ Gt + Ge fH 


Remark 2. A function V (x,t) satisfying the left inequality of (5.26) is said to 
be positive definite. A function satisfying the right inequality of (5.26) is said 
to be decrescent. 


Remark 3. To show stability for time-variable systems, it is necessary to 
bound the function V (x,t) by a function that doesn’t depend on £. Oo 


When using Lyapunov theory on adaptive contro! problems, we often find 
that dV /dt only is negative semidefinite. This implies that additional con- 
ditions must be imposed on the system. The following lemma gives a useful 
result. 


LEMMA 531 Barbalat’s lemma 


If g is a real function of a real variable ¢, defined and uniformly continuous for 
t > 0, and if the limit of the integral 


t 
IESE 
0 

as t tends to infinity exists and is a finite number, then 


lim g(t) = 0 o 


Remark. A consequence of Barbalat’s lemma is that if g € Le and dg/dt is 
bounded, then 

lim g(t) = 0 m 

When applying Lyapunov theory to an adaptive control problem, we get 

a time derivative of the Lyapunov function V, which depends on the control 

signal and other signals in the system. If these signals are bounded, Lemma 5.1 


and the remark that follows can be used on dV /dt to prove stability. We have 
the following theorem. 


THEOREM 54 Boundedness and convergence set 


Let D = {x € R*|||x|| < r} and suppose that f(x,t) is locally Lipschitz on 
D x [0, co). Let V be a continuously differentiable function such that 


a (Ilxll) < V(x, t) < oa((lxll) 
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and 
dv av. av 


a He 
Vt > 0, Vx € D, where a and a, are class K functions defined on [0,7r) 
and W(x) is continuous on D. Further, it is assumed that dV /dt is uniformly 
continuous in f. 
Then all solutions to Eq. (5.25) with ||x(to)|| < az*(@i(r)) are bounded and 
satisfy 


F(x,t) S —W(x) <0 | 


W(x(t})} +0 as t—-co 


Moreover, if all the assumptions hold globally and a, belongs to class Kos, the 
statement is true for all x(tọ) € F”. oO 


A proof of a slight modification of this theorem can be found in Khalil 
(1992). The theorem states that the states of the system are bounded and that 
they approach the set {x € D] W(x) = 0}. In the theorem it is assumed that 
dV /dt is uniformly continuous, that is, that the continuity is independent of 
t. A sufficient condition for this is that V is bounded. 


5.5 DESIGN OF MRAS USING LYAPUNOV THEORY 


We will now show how Lyapunov’ stability theory can be used to construct 
algorithms for adjusting parameters in adaptive systems. To do this, we first 
derive a differential equation for the error, e = y — Ym. This differential equa- 
tion contains the adjustable parameters. We then attempt to find a Lyapunov 
function and an adaptation mechanism such that the error will go to zero. 
When using the Lyaponov theory for adaptive systems, we find that dV /dt is 
usually only negative semidefinite. The procedure is to determine the error 
equation and a Lyapunov function with a bounded second derivative. Theorem 
5.4 is then used to show boundedness and that the error goes to zero. To show 
parameter convergence, it is necessary to impose further conditions, such as 
persistently excitation and uniform observability, on the reference signal and 
the system. (See the references in the end of the chapter.) We start with a 
simple example. 


EXAMPLE 5.7 First-order MRAS based on stability theory 
Consider the problem in Example 5.2. The desired response is given by 


Lm 
at 
where a,, > 0 and the reference signal is bounded. The process is described by 


= —Am Vm + bmlte 


oY = —ay + bu 
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The controller is 
u = Bu, — Gry 
Introduce the error 
E= Y— Ym 
Since we are trying to make the error small, it is natural to derive a differential 
equation for the error. We get 


de 
at 


Notice that the error goes to zero if the parameters are equal to the values 
given by Eqs. (5.8). We will now attempt to construct a parameter adjustment 
mechanism that will drive the parameters 6, and 8z to their desired values. 
For this purpose, assume that by > 0 and introduce the following quadratic 
function: 


= —G,,¢ — (b82 + a —am)y + (b01 — bm) Ue 


1 1 1 
Y {e, 61, 82) = 3 G + by (bô: +a am” + by (bê: — bn) 


This function is zero when e is zero and the controller parameters are equal 
to the correct values. For the function to qualify as a Lyapunov function the 
derivative dV /dt must be negative. The derivative is 


dV oi 


au °au + 7 (002 +a- am) te T (be, — 


1 
bm) ae 
= ape + — 1 (66, +@—Gn) ao, — yye 

Y dt 
1 dé, 
+ 7 (8, — bm) (T + yuce) 


If the parameters are updated as 


dei ace 
dt (5.27) 
dêz _ 
dt = yye 
we get 
dV > a e 
dt” 


The derivative of V with respect to time is thus negative semidefinite but 
not negative definite. This implies that V(t) < V(0) and thus that e, 81, and 
82 must be bounded. This implies that y = e+ Ym also is bounded. To use 
Theorem 5.4, we determine 


av de 


Te = —2a mes = —2a me (ame — (bba + a—an)y + (b81 — bm) ue) 
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Figure 5.11 Block diagram of an MRAS based on Lyapunov theory for a 
first-order system. Compare with the controller based on the MIT rule for the 
same system in Fig. 5.4. 


Since u., e, and y are bounded, it follows that V is bounded; hence dV fdt is 
uniformly continuous. From Theorem 5.4 it now follows that the error e will go 
to zero. However, the parameters will not necessarily converge to their correct 
values; it is shown only that they are bounded. To have parameter convergence, 
it is necessary to impose conditions on the excitation of the system. (Compare 
with Example 5.3.) 

The adaptation rule given by Eqs. (5.27) is similar to the MIT rule given by 
Eqs. (5.9), but the sensitivity derivatives are replaced by other signals. A block 
diagram of the system is shown in Fig. 5.11. Compare with the corresponding 
block diagram for the system with the MIT rule in Fig. 5.4. The only difference 
is that there is no filtering of the signals u. and y with the Lyapunov rule. In 
both cases the adjustment law can be written as 


dé 
ae =P? (5.28) 
where @ is a vector of parameters and 
g=(-u y) 
for the Lyapunov rule and 
_ Um _ T 
p = P+an, ( Ue y) 


for the MIT rule. The adjustment rule obtained from Lyapunov theory is 
simpler because it does not require filtering of the signals. Figure 5.12 shows a 
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Figure 5.12 Simulation of the system in Example 5.7 using an adaptive 
controller based on Lyapunov theory. The parameter values are a = 1, b = 0.5, 


Gn = Öm = 2, and y = 1. (a) Process (solid line) and model (dashed line) 
outputs. (b) Control signal. 
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Figure 5.13 Controller parameters @, and @2 for the system in Example 5.7 
when y = 0.2, 1, and 5. The dotted lines are the parameters obtained with 
the MIT rule. Compare Fig. 5.6. 


simulation of the system for the case G(s) = 0.5/(s +1) and G,,(s)} = 2/(s4+ 2). 
The behavior is quite similar to that obtained with the MIT rule in Fig. 5.5. 
Notice, however, that arbitrary large values of the adaptation gain y can be 
used. with the Lyapunov approach. 

Figure 5.13 shows the parameter estimates in the simulation for different 
values of adaptation gain y. For comparison we have also shown the parameters 
obtained with the MIT rule. o 


210 Chapter 5 Model-Reference Adaptive Systems 


State Space Systems 


We will now show how Lyapunov’s theory can be used to derive stable MRASs 
for general linear systems. The idea is the same as used previously. It can be 
described as follows: 

1. Find a controller structure. 

2. Derive the error equation. 

3. Find a Lyapunov function and use it to derive a parameter updating law 

such that the error will go to zero. 

Consider a linear system described by 


dx 

—=Ax+B . 

it + Bu (5.29) 
Assume that it is desired to find a control law so that the response to command 


signals is given by 


dtm — 
dE. = Amim + Bryttc (5.30) 
A general linear control law for the system given by Eq. (5.29) is 
u = Mu,— Lx (5.31) 
The closed-loop system then becomes 
Z = {A— BL)x + BM u. = A,(0)x + Be(8)te (5.32) 


The control law can be parameterized in different ways. All parameters in the 
matrices L and M may be chosen freely. There may also be constraints among 
the parameters. The general case can be captured by assuming that the closed- 
loop system is described by Eq. (5.82), where matrices A, and B, depend on a 
parameter @. 


Compatibility conditions It is not always possible to find parameters @ such 
that Eq. (5.32) is equivalent to Eq. (5.30). A sufficient condition is that there 
exists a parameter value @° such that 


A:(6") = Am (5.33) 
B,(0°) = Bm 


This condition for perfect model-following is fairly stringent. When all param- 
eters in the control law can be chosen freely, it implies that 
A—-A, = BL 
Bn = BM 


This means that the columns of matrices A — Am and Bm are linear combi- 
nations of the columns of matrix B. If these conditions are satisfied and the 


6.5 Design of MRAS Using Lyapunov Theory 211 


columns of B and B,, are linearly independent, then the matrices L and M 
are given by 


L = (B" B)' Bt (A—A,,) = (BI BY! BĪ (A — An) 
M = (B7 BY' BT Bm = (BË BY 1B" Bm 
The error equation Introduce the error defined as 


E= X — Xm 
Subtracting Eq. (5.30) from Eq. (5.29) gives 
de dx dëm _ 
u u ae = Ax + Bu — Å mXm — Bme 
Adding and subtracting Amx from the right-hand side give 
= = Ane + (A— An — BL) x + (BM — Bp) tc 
= Ame + (Ac(0) — Ac(Oo)) x + (Be(8) — Be(@0)) ue 
= Ame + Y (6 — 6°) (5.34) 


To obtain the last equality, it has been assumed that the conditions for exact 
model-following are satisfied. This is required for @° to exist. To derive a 
parameter adjustment law, we introduce the Lyapunov function 


V(e,6) = 5 (ye? Pe + (6 — 0°)? (0 — 6°) 


where P is a positive definite matrix. The function V is positive definite. To 
find out whether it can be a Lyapunov function, we calculate its total time 
derivative dv de 
av _ Vor _ poy? _ poyr ae 
dae Qe+7(@— O°)? Pe + (8 — 0°) E 
_ Vor _ pT dé T 
=—5e Qe + (@ — 0°) (S +y¥¥ Pe) 
where @ is positive definite and such that 
AT P+ PA, = —Q 
Notice that it follows from Theorem 5.2 that a pair of positive definite matrices 
F and Q with this property always exist if Am is stable. 
If the parameter adjustment law is chosen to be 


-a OO T ' 
Tk yF Pe (5.35) 
we get av 
__’ or 
de ~~ 9° @ 


The time derivative of the Lyapunov function is negative semidefinite, By using 
Lemma 5.1 in the same way as in Example 5.7 it can be shown that the error 
goes to zero. Notice that we have assumed that all states x are measurable. 
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Adaptation of a Feedforward Gain 


We now attempt to use Lyapunov theory to derive parameter adjustment laws 
for the problem of adjusting a feedforward gain. We consider the case in which 
the plant has transfer function kG(s), where G(s) is known and & is unknown. 
The desired response is given by the transfer function koG (s). This problem 
was discussed previously in Examples 5.1 and 5.3. The error is given by 


e = (kG(p)@ — koG(p) ue = kG(p)(0 — O° )uc 


where 6° = ko/k. To use Lyapunov theory, we first introduce a state space 
representation of the transfer function G. The relation between the parameter 
Ə and the error e can then be written as 


dx 

— = Ax + B(8 — Oyu, 

di + B(O— Ou (5.36) 
e = Cr 


If the homogeneous system * = Ax is asymptotically stable, there exist positive 
definite matrices P and Q such that 


A’P+PA=-@ (5.37) 
Choose the following function as a candidate for a Lyapunov function: 


V= 5 (yx? Px + (@—6°)?) 


The time derivative of V along the differential equation (Eqs. 5.36) is given by 


Using Eqs. (5.36), we get 
T= 1 (Ax + Buc(@—0°))" Px +27 P (Ax + Buel — 0°))) 
+ (6-6) d 
= -Ż xT Qx + (8—8) (F + Yle BT Pa) 


If the parameter adjustment law is chosen to be 


d8 

dt 
we find that the derivative of the Lyapunov function will be negative as long 
as x # 0. The state vector x and the error e = Cx will go to zero as ¢ goes to 
infinity. Notice, however, that the parameter error 6 — 6° will not necessarily 
go to zero. 


= ~7u, BT Px (5.38) 
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Output feedback The result obtained is quite restrictive because it requires 
that all state variables are known. A parameter adjustment law that uses 
output feedback can be obtained if the Lyapunov function can be chosen so 
that 

B’P=C 
where C is the output matrix of the system in Eq. (5.34). With this choice of 
P it follows that 


B? Px = Cx =e 
and the adjustment rule becomes 
dé 
Ge Te 


The appropriate condition is given by the celebrated Kalman-Yakubovich lem- 
ma. The following definition is needed to state this lemma. 


DEFINITION 5.5 Positive real transfer function 
A rational transfer function G with real coefficients is positive real (PR) if 


ReG(s)>0 for Res>0 (5.39) 
A transfer function G is strictly positive real (SPR) if G(s — £) is positive real 
for some real € > 0. Oo 


The concept of SPR is discussed further in Section 5.6. Let it suffice to 
mention that G(s) = 1/(s + 1) is SPR and G(s) = 1/s is PR but not SPR. The 
following result gives a state space interpretation of SPR. 


LEMMA 5.2 Kalman-Yakubovich lemma 
Let the time-invariant linear system 


dx 
—=-Ax+B 
dt + Bu 
y= Cx 


be completely controllable and completely observable. The transfer function 
G(s) = C(s1 — A) 1B 


is strictly positive real if and only if there exist positive definite matrices P 
and Q such that 
ATP + PA = —Q 
and 
BTP=C oO 


A proof of this result is given in Section 5.6. There is a more general version 
of the theorem that applies to systems with a direct term from input to output. 
The simpler version is sufficient for our purposes. 
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(a) Model 


Figure 6.14 Block diagrams of the adaptive systems for feedforward gain 
compensation obtained by (a) the MIT rule and (b) the Lyapunov rule. 


THEOREM 5.5 MRAS using the Lyapunov rule 


Consider the problem of adapting a feedforward gain. Assume that the transfer 
function G is strictly positive real. Then the parameter adjustment rule 


dé 
where y is a positive constant, makes the output error e in Eqs. (5.36) go to 
zero. o 


The control law of Eq. (5.40) is very similar to the control law obtained 
by the MIT rule, Eq. (5.5). This is illustrated in Fig. 5.14, which shows block 
diagrams of both systems. The only difference between the systems is that 
the connection to the first multiplier comes from the model output for the MIT 
rule and from the command signal for the Lyapunov rule. This seemingly small 
difference has major consequences, however. 


A remark on the assumptions It may seem strange that such drastically dif- 
ferent behaviors can be obtained by minor modifications of the system. It also 
seems strange that it is possible to use arbitrarily high adaptation gains. This is 
because the assumption that a transfer function is positive real is very strong. 
It follows from Definition 5.5 that Re G{iw) > 0 if the transfer function G(s) 
is positive real. This means that the Nyquist curve of G is in the right half- 
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plane. Such a system is stable under proportional feedback with arbitrarily 
high gain. The closed-loop system can be made arbitrarily insensitive to the 
gain variations. The result is of limited practical value because of the strong 
assumptions that are made. 


Summary 


In this section we have shown that it is possible to construct parameter ad- 
justment rules based on Lyapunov’s stability theory. The adjustment rules 
obtained in this way guarantee that the error goes to zero, but it cannot be 
asserted that the parameters converge to their correct values. The adjustment 
rules obtained are similar to those obtained by the MIT rule. However, the 
rules are not normalized. The adjustment rules have the remarkable property 
that arbitrarily high adaptation gains can be used. This property depends on 
the strong assumptions that are made. This is discussed further in Chapter 6. 


5.6 BOUNDED-INPUT, BOUNDED-OUTPUT STABILITY 


Systems can be described from two points of view: the internal or state space 
view or the external or input-output view. The state space approach is based on 
a detailed description of the inner structure of the system. In the input-output 
approach, a system is considered to be a black box that transforms inputs to 
outputs. In Section 5.5 we approached stability from the state space view. In 
this section we develop stability theory from the input-output view. In the next 
section the results are applied to design of adaptive controllers. 

We start with a brief presentation of the operator view of dynamical sys- 
tems. This leads naturally to the concept of bounded-input, bounded-output 
(BIBO) stability. The fundamental results like the small gain theorem and the 
passivity theorem are then presented. In Section 5.5 we found that the notion 
of positive real was essential. This notion, which is closely related to passivity, 
will also be discussed. 


The Operator View of Dynamical Systems 


Signals are elements of a normed space X, which we call the signal space. A 
system S$ is considered as an operator S : X — X. For simplicity we consider 
systems with one input and one output and the signals are real functions from 
R to R. Several choices of norms are considered, for example, the Lo norm 


l= (f wo a): 
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or the sup norm 


lel = sup |u(é)| 
0<t<00 


A drawback of using Lə is that it must be assumed a priori that all signals 
go to zero as £ — co. The notion of extended space is introduced to avoid this 
assumption. This is introduced as follows. l 

Let Y be the space of real-valued functions on [0, œo}. Let x be an element 
of Y. The truncation of x at T > 0 is defined as 


os _ fat) O< t<T 
n= {3 >T 


DEFINITION 5.6 Extended space 


If X is a normed linear subspace of Y, then the extended space X, is the set 
{x€ Y| xr € X for some fixed T > 0}. o 


The extended Lə space is denoted Le. There is now a simple way to 
introduce the notion of the gain of a system. 


DEFINITION 57 Gain of a nonlinear system 
Let the signal space be X.. The gain 7(S) of a system S is defined as 


[|Sze]] 
S) = sup 
= ey la 


where u is the input signal to the system. 
Remark. The gain is thus the smallest value such that 


Sult < 7(S)lul] for ali u € Xe 


We use supremum because the maximum of ||Su||/||u|| may not exist for signals 
in the class that we are considering. | 


We illustrate the definition with a few examples. 


| EXAMPLES5.8 Linear systems with signals in Lo, 


Let the signal space be Le. Consider a linear system with the transfer function 
G(s). Assume that G(s) has no poles in the closed right half-plane and that 
the system is initially at rest. Let u be the input and y the output, and let 
U and Y be the corresponding Laplace transforms. It follows from Parseval’s 
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theorem, Theorem 2.8, that 


b= f XOa= = f Y(@)¥(-iw) do 


== T. Glio)U (io)G(-io)U (—io) de 


alo l 
< max|G(ia)| on L. U(i@)U (-iw) da 
=max|@(io)P f w(t) dt = max |G(io)P - |? 
= 0 


Hence 
iyi < max |G (ie)| - [ell 


The gain is thus less than max|G(i@)|. We get equality in the above equation 
if u is a sinusoid with the frequency that maximizes |G(iw)|. However, such 
a signal is not in Læ. The value of ||y|| can be made arbitrarily close to 
max |G(iw)| with a truncated sinusoid in Lz, by making T sufficiently large. 
The gain of the system is thus 


7(G) = max|G(io)| (5.41) 
oO 


EXAMPLE 5.9 Linear system with sup norm 
Consider a stable linear system with impulse response A(t). We have 


y(t) = Í Mt- 


Using the sup norm, we get 


bOl=|[ houe- dr 


< suphu) | AC) ae 
This gives 
sup |x(6)| < 7 (G) - sup lu(e)| 


where the gain of the system is given by 


(6) = | hE de 
If we let uo = max; |u(é)|, the maximum is assumed for the signal 


u(s) = uo sign (h(t —s)) 


However, this signal is not in Lee. Since the system is stable, we can get 
arbitrarily close with a signal in Lz, by making T sufficiently large. m) 
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Figure 5.15 Illustration of the gain of a static nonlinearity. 


EXAMPLE5.10 Static nonlinear system 
Consider a static system that is described by the nonlinear equation 


y(t) = Fu) 
For all norms we have 
lx(t)| < max | f(u(2))| 
The gain of the system is thus given by 
m IF) 
rone 


The gain of a static system has a simple interpretation. A function whose norm 
is y can be bounded between the straight lines y = +yu, as is illustrated in 
Fig. 5.15. o 


Having defined the gain of a system, we can now define stability. 


DEFINITION 5.8 BIBO stability 


A system is called bounded-input, bounded-output (BIBO) stable if the system 
has bounded gain. O 


Notice that this definition refers to stability of a system and not stability 
of a particular solution. Also notice that a system with bounded gain is BIBO 
stable but that the converse is not true. The static system y = u? does not have 
finite gain, but it is BIBO stable. 


Stability Criteria 
Having defined the notion of stability, we now give criteria for stability. For this 
purpose, consider the simple feedback system in Fig. 5.16. We are interested 


in determining when the gain from u to y is bounded. We have the following 
theorem. 
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THEOREM 5.6 The small gain theorem 


Consider the system in Fig. 5.16. Let 7, and 72 be the gains of the systems Hy 
and Hə. The closed-loop system is BIBO stable if 


Y1¥2 <1 (5.42) 
and its gain is less than 
Yı 
= m 5.43 
4 1 = 7172 ( 


Outline of proof: For a rigorous proof it must first be established that y exists. 
If this is true, we have 


y= Me = H,(u — Hoy) 
Hence 
Hol] < [Erel + [21 Hes] < yill + 717eIMlol 


Because of Eq. (5.42) we can solve for ||y||. Hence 
yı 
< ———— |u| = u 
Holl < ball = vl 


which proves BIBO stability and gives the expression (5.43) for the gain of the 
system. o 


Remark 1. The result has a strong intuitive interpretation. It simply says that 
if the total gain around the loop is less than 1, then the closed-loop system is 
stable. 

Remark 2. For the special case of linear systems with Le norms it follows from 
Example 5.8 that the gain is the maximum magnitude of the transfer function. 
The theorem can be interpreted as an extension of the Nyquist theorem. The 
condition (5.42) implies that the loop gain is always less than 1. From this 
interpretation we can also conclude that the result is quite conservative. 


Passivity 


We now present another stability theorem that is also based on the input- 
output point of view. The starting point is the notion of passivity, which is 
an abstract formulation of the idea of energy dissipation. Passive systems are 


Figure 5.16 Block diagram of a simple feedback loop. 
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common in engineering. A system composed only of components like resistors, 
capacitors, and inductors is one example from electrical engineering. A system 
composed of masses, Springs, and dashpots is an example from mechanical 
engineering. When dealing with electrical systems, we will consider two-port 
systems in which the current is the input and the voltage is the output. The 
same concepts apply to mechanical systems, in which the variables are position 
and force. 

Passivity is naturally associated with power dissipation, Such a concept 
can be defined for linear as well as nonlinear systems. Roughly speaking, the 
passivity theorem says that a feedback connection of one passive system and 
one strictly passive system is stable. To state the result formally, we need an 
abstract notion of passivity. We start with the operator view of systems, in 
which a system is represented by an operator mapping signals to signals. The 
signal space is assumed to be Lz, with a scalar product defined by 


oo T 
(x|y} = | x(s)y(s) ds= | x(s)>(s) ds 
— 
We have the following definition. 


DEFINITION 5.9 Passive system 
A system with input u and output y is passive if 
(lu) 2 0 
The system is input strictly passive (ISP) if there exists € > 0 such that 
Olu) = ejja” 
and output strictly passive (OSP) if there exists £ > 0 such that 


(y|u) z ell? 


oD 


Notice that in electrical systems the power is proportional to the product of 
current and voltage. The definition is thus a very natural abstraction. The 
following example illustrates the definition of passivity. 


_ EXAMPLE 5.11 Static nonlinear systems 


Consider a static nonlinear system characterized by the function f : R —> R. 
We have 


olu = | Feul) at 
The right-hand side is thus nonnegative if 
xf(x) 20 (5.44) 
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which is the condition for passivity. This condition means that the graph of the 
curve f is entirely in the first and the third quadrants. The system is input 
strictly passive if 


x f(x) 2 d|x)? 
It is output strictly passive if 
x f (x) > 6 f(x) 
A static system with f(x) = x + x? is thus input strictly passive, and a static 
system with f(x) = x/(1+ |x|) is output strictly passive. B) 


Positive Real Functions 


For linear systems the concept of passivity is closely related to the proper- 
ties positive real and strictly positive real introduced in Definition 5.5 in Sec- 
tion 5.5. The notion of positive real did actually originate from an effort to 
characterize driving point impedance functions for linear circuits composed of 
passive components. The driving point impedance function is the transfer func- 
tion from current to voltage across two terminals in a circuit. The driving point 
admittance function is the transfer function from voltage to current. In circuit 
theory it was established that such impedance functions have certain proper- 
ties that were taken as the definition of positive real. In this section we discuss 
some properties of positive real functions. It follows from Definition 5.5 that 
if the transfer function G(s) is PR (SPR), then its inverse 1/G(s) is also PR 
(SPR). This is a direct consequence of the symmetry of admittance functions 
and impedance functions. It does not matter whether we consider current or 
voltage as the input to a circuit. Positive real functions can be characterized 
in many different ways. An alternative to Definition 5.5 that is easier to use 
is given by the following theorem. 


THEOREM 5.7 Conditions for positive realness 


A rational transfer function G(s) with real coefficients is PR if and only if the 
following conditions hold. 


(i) The function has no poles in the right half-plane. 


(ii) If the function has poles on the imaginary axis or at infinity, they are 
simple poles with positive residues. 


(iii) The real part of G is nonnegative along the iw axis, that is, 
Re (G(ia)) > 0 (5.45) 


A transfer function is SPR if conditions (i) and (iii) hold and if condition {ii) 
is replaced by the condition that G(s) has no poles or zeros on the imaginary 


axis. 
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Proof: Assume that G(s) is PR. Since it is rational, the only singularities are 
poles. A function assumes all values around a pole. According to Definition 5.5 
the function has positive real part for Res > 0. Hence it cannot have poles 
in this region. Equation (5.45) follows by setting s = iœ in Definition 5.5. 
Furthermore, G(s) cannot have multiple poles at infinity because the condition 
Re G(s) > 0 for Res > 0 would then be violated. For the same reason a pole at 
infinity must also have positive residue. l 

We have thus shown the necessity. To show sufficiency, we use the fact 
that a function that is analytic in a region assumes its largest values on the 
boundary. Consider the function 


F(s) = e@) 


We have 
|F(s)| = Bee (5.46) 


Let the region D be bounded by the imaginary axis and an infinite half-circle 
to the right with the imaginary axis as a diameter. Let Il be the boundary of 
D. Assume that conditions (i), (ii), and (iii) hold. Because of condition (iti) 
we have |F(s)| > 1 on the imaginary axis. It now remains to investigate the 
value of F on the large half-circle. It follows from condition (ii) that G has at 
most one pole at infinity. We have three cases: G(s) may go to zero; it may go 
to a constant, which must be positive because of condition (iii); or it may go 
to infinity as ks, where the constant k must be positive because of condition 
(ii). We can thus conclude that |F(s)| > 1 on T. Since F is analytic in D, the 
condition then also holds on D, and Eq. (5.45) then follows. Notice that it also 
follows that the function G(s) does not have any zeros inside D, D 


We now illustrate the different passivity concepts on linear time-invariant 
systems. 


EXAMPLE 5.12 Linear time-invariant systems 


Consider a linear time-invariant system with the transfer function G(s). As- 
sume that G(s) has no poles in the closed right half-plane. It follows from 
Parseval’s theorem that 


oo oo 


Ge [rou oe = J Yliw)U (ia) do 
= a | G(ia)U (ia) U(-—iw) do 


=z f" {G(io)} U (iw) U (io) do (5.47) 


ale 
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where Y and U are the Laplace transforms of y and u, respectively. If G(i@) 
is positive real (see Definition 5.5), we have Re G(i@) > 0, and we get 
Ql) 20 
which shows that the system is passive. It follows from Definition 5.9 that a 
positive real transfer function is input strictly passive if 
Re G(iw) > e> 0 
and output strictly passive if 
Re G(iw) > e|G(ia)|* 


The transfer function G(s) = s + 1 is thus SPR and ISP but not OSP, The 
transfer function G(s} = 1/(s + 1) is SPR and OSP but not ISP. The transfer 
function 
y= 2th 
~ (s+1)/? 
is OSP but not ISP and SPR. o 


In control systems applications it is common for transfer functions to be 
proper or strictly proper. The output strict passivity is therefore the concept 
that is normally used in these applications. 


Proof of the Kalman-Yakubovich Lemma 


Having developed the notion of SPR, we can now give a proof of the Kalman- 
Yakubovic lemma, which was given as Lemma 5.2 in Section 5.5. Consider the 
linear system 


dx 

— = Ax+B 

dt tBu (5.48) 
y = Cx 


which is assumed to be completely controllable and completely observable. The 
system has the transfer function 


G(s) = C(sI — A) 1B (5.49) 


We will prove that a necessary and sufficient condition for G(s) to be SPR is 
that there exist positive definite matrices P and Q such that. 


ATP + PA= —Q (5.50) 


and 
B'P=C (5.51) 


We will first prove necessity. If we use V = xf Px as a Lyapunov function, it 
follows from Theorem 5.1 that the system (5.48) is stable. This implies that 
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the transfer function G(s) is analytic in the closed right half-plane. To prove 
that G(s) is SPR, it remains to verify condition (iii) in Theorem 5.7. It follows 
from Eq. (5.50) that 


—sP — ATP + sP — PA = (—sI — A)T P + P(sI—A) =@Q 
To obtain this equation, we have added and subtracted sP. Multiplying the 
equation with B?(—sI — A)T from the left and (sI — A)~!B from the right 
gives 
B? P(sI—A)"'B+B? (~sl—A)" TPB = B?(—sI—A)-? Q(sI—A)!B (5.52) 
Since GT (—s) = G(—s), Eq. (5.49) now implies that 
2Re G(iw) = G(iw) + G(-iw) = B? (—iwI — A)-*Q(iwI — A)"'B > 0 


It now follows from Theorem 5.7 that G(s) is PR. Replacing s by s — € in the 
above calculations, we find in a similar way that 


Re G(iw — £) > 0 


Since the matrix A has ali its eigenvalues in the open left half-piane, it follows 
that the matrix A + €T is also stable. It now follows from Theorem 5.7 that 
G(s) is SPR. 

To prove sufficiency, we start with the assumption that the system (5.48) 
has a transfer function G(s) that is SPR. The proof is based on a direct 
construction of the matrices P and Q. Consider the expression 


_ B(s) | B(-s)_ A(~s)B(s) + A(s)B(—s) — Qs) 
Gle) + 6-8) = Sie + Ae) AAC) T AOAC) 


where 

Q(s) = gi (1) 128 4 + Qe Gye Fetih 
Notice that polynomial Q (s) has only terms of even power and that all coef- 
ficients q; are positive, since G(s) is SPR. Let Q be a diagonal matrix with 
elements g;. Introduce the following realization of the transfer function: 


—G, —fg -> —GQn_1 Ër 1 
dx 1 0 0 0 0 
a = . x+ H 
0 0 «:: 1 0 0 
With this choice we have 
grt 
1 gh 2 


(sI —A)'B= 


nE (5.53) 
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and 
BT (—sI — A) Q(sI — A)'B = Wee = G(s) + G(—s) (5.54) 


Since G(s) is SPR, the matrix A has no eigenvalues in the right half-plane 
or on the imaginary axis. Let P be the solution to Eq. (5.50). This matrix is 
positive definite because @ is positive definite and A has all its eigenvalues 
in the left half-plane. Furthermore, let Č = BTP. We now show that C = C. 
Since P is the solution to Eq. (5.50), it follows that 


C(sf — AJB + B?(-sI — A)T ĜT = BT (—sI — A)-T Q(sI — A)“ B 


But according to Eq. (5.54) the right-hand side is equal to G(s) + G(—s). Since 
a partial fraction expansion is unique, it follows from Eq. (5.52) that 


G(s) = C(sI — A)"'B = C(sI — A) 1B 
which implies that C = C, and the theorem is proven. O 


Test for Positive Realness 


It is useful to have an algorithm to test whether a function is positive real. 
Theorem 5.7 can be used for this purpose. Condition (i) is easily tested by an 
ordinary Routh-Hurwitz test. Condition (ii) is a straightforward calculation. 
To test condition (iii), we proceed as follows: 


oo- 2 


then 

Biiw) _—_ B(im)A(—ia) 

Alio) ~~ A(iw)A(—ia) 

Since the denominator is nonnegative and G(i@) is symmetric with respect to 
the real axis, it suffices to investigate whether the function 


Re G(i@) = Re 


f(@) = Re (Bia) A(—ia)) 


is nonnegative for œ > 0. Notice that f is an even function of w. It is thus 
sufficient to investigate whether f(@) has any real zeros. This can be verified 
directly by solving the equation f{@) = 0. There is also an indirect procedure. 
To describe this, introduce the polynomial 


a(x) = f (Vx) 


The problem is thus to find whether the polynomial g(x) has any zeros on the 
interval (0,00). This classical problem can be solved as follows: 
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1. Let gi{x) = g(x), g2(x) = g'(x). Form a sequence of functions {g(x}, 
go(x),.--,9n(x)}} by letting —g,42(x) be the remainder when dividing 9; (x) 
by grii(x). Proceed until gn is a constant. 

2. Let V(x) be the number of sign changes in the sequence {gi(x), ga(x),..., 
9n(x)}. 

3. The number of real zeros of the function g(x) in the interval [a, b] is then 
V(a) — V(b) 

The function sequence {gi(x), ga(x),...,9n(x)} is called a Sturm sequence. The 
procedure is illustrated by an example. 


EXAMPLE 5.13 Second-order system 
Consider the transfer function 


s +63 +8 
G(s) = s +4343 


First notice that G has no poles in the right half-plane. Furthermore, 
f(@) = Re ((—w? + Gio + 8)(—w? — 4iw + 3)) = wt + 130? + 24 


Hence 

g(x) = x? + 13x + 24 
We get 

gi(x) = X + 18x + 24 

g2(x) = 2x +13 

73 

93(x) = 4 
Since V(0) = 0, V(co) = 0, g(x} has no zeros on the positive real axis. The 
transfer function G(s) is then SPR. oO 
An Alternative Test 


An alternative test for SPR for a system with a proper transfer function can be 
obtained from the proof of the Kalman-Yakubovich lemma. Write the matrix A 
in controllable canonical form. Solve the equations 


qg 0 --- 0 
G2 0 

ATP + PA= 
O O -> Q 

BTP=C 


where P is a symmetric matrix. This gives n + n(n + 1}/2 equations for the 
unknown elements of P and Q. The transfer function is SPR if g; > Q for 


i= 1,...,7. 
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The Passivity Theorem 
Having established a notion of passivity, we can now state a key result. 


THEOREM 5.8 The passivity theorem 


Consider a system obtained by connecting two systems Hı and Ho in a feedback 
loop as in Fig. 5.16. Let Hı be strictly output passive and Hs be passive. The 
closed-loop system is then BIBO stable. 


Proof: Since Hi is strictly output passive, we have 


(ye) > dilly? 


Since e = u — Hoy, we have 


df? < (le) = (ju — Hay) = (y|u) — (y| Hoy) (5.55) 


Since H; is passive, we have 


(y| Hey) > 0 
and it then follows from Eq. (5.55) that 


Sly? < (yle) < liy lel 
where the last inequality follows from Schwartz inequality. We now get 


1 
< — 
lyi < = lell 
which proves the result. oO 


Remark. The passivity theorem may also be regarded as an extension of 
Nyquist’s stability theorem. Instability is avoided by having a loop transfer 
function with a phase lag less than 180°. Oo 


Relations between Passivity and Small Gain Theorems 


The small gain theorem (Theorem 5.6) and the passivity theorem (Theo- 
rem 5.8) are closely related. To investigate this connection further, we consider 
signal spaces that are inner product spaces and we show that the small gain 
theorem can be derived from the passivity theorem. We start with Fig. 5.16 
and make a sequence of transformations of the feedback loop that are shown 
in Fig. 5.17. 

Consider the closed-loop system in Fig. 5.17(a). Assume that the system 
H, is strictly output passive and that H; is passive. In Fig. 5.17(b) we have 
introduced two loops that cancel each other. The input-output relations of the 
encircled loops are (J + Hı) Hı and I — Ho, respectively. These two systems 
are shown in Fig. 5.17(c), where we have also added two leops and two gains 
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(1/2 and 2) that cancel each other. The transfer functions of the encircled loops 
are 


Sı =2(H, +21 H, — (Hı 4+: 2714+ D) = (Hi + D(A, - 2) 
and 
1 i E 
Sa = — I— 5 (1 — He) z- He) = (He + 1) (H-I) 
The system obtained after the transformations is shown in Fig. 5.17(d). 
The systems in Fig. 5.17(a) and Fig. 5.17(d) are equivalent. We use their 


equivalence to prove the result. First we observe that if the system (H + IJ! 
exists, it commutes with H. To prove this, use the identity 


H + B? =H(H+1)=(H+1DH 
and multiply from the left and the right by (J + HY}; then 
(H+ 17H = H(H +1)" 


a) 


Figure 5.17 Four equivalent systems. 
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Subtracting (H + IJ! from both sides gives 
(H + DH — I) =(H—I{H +I! 
Consider the systems S and H related through 
S= (H-H +N = (H +H -D) 
The input-output relation for the system S is 
y= Su = (H — I)(H + Iy*u 


Introduce 
x= (H + I)u 
We find that 
y= (H—Dx 
u = (H + I)x 
Hence 


Iyl? = (yl) = (Hx — x|Hx — x) = (Hx|Hx) + (|x) — 2(Hx|x) 
Similarly, we find that 
lul? = uju) = (Hx + x|Hx + x) = (H2x|Hx) + (x|x) + 2(Hx|x) 


Hence 
loll? = hul? — 48 xix} (5.56) 


If H is passive, we have (Hx\x) > 0; hence yj} < lull, which implies that 
y(S) < 1. Similarly, we find that y(S) < 1 if H is strictly output passive. 
It follows from Eq. (5.56) that 


ul — ijy? llel — || Sul]? u|? 
(Haja) = WIE = DAP _ el? = ISe 04 egy bel 
4 4 4 

This implies that H is passive if y(S) < 1 and strictly output passive if 
y(S) <1. 

Notice that the argument. would be the same if § and H were complex num- 
bers. The result is an example of the equivalence between complex numbers 
and operators on inner product spaces. 


5.7 APPLICATIONS TO ADAPTIVE CONTROL 


The results from input-output stability theory are now used to construct ad- 
justment rules for adaptive systems. So that we can focus on the principles and 
avoid unnecessary details, only the problem of adjusting a feedforward gain is 
considered in this section. 
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Consider a system with transfer function kG(s) where G(s) is known 
and k is an unknown constant. We will determine an adaptive feedforward 
compensation so that the transfer function from command signal to output 
is koG(s). This problem was previously considered in Examples 5.1 and 5.8. A 
parameter adjustment law was also derived for the problem in Section 5.5 using 
Lyapunov theory. This control law can be represented by the block diagram in 
Fig. 5.14(b). According to Theorem 5.5 the adaptive system will be stable if 
the transfer function G(s) is SPR. This condition indicates that the result is 
related to passivity theory. To establish this, we redraw the block diagram as 
in Fig. 5.18, which gives a configuration in which the passivity theorem can 
be applied. To use the passivity theorem, we must investigate the properties 
of the dashed block in Fig. 5.18. We have the following lemma. 


LEMMA 53 Property of positive real systems 


Let r be a bounded function, and let G(s) be a transfer function that is positive 
real. The system whose input-output relation is given by 


y=r(G(p)ru) 
is then passive. 
Proof: It follows that 


oo 


olu = | sul) ae = | (utr) (GP)ru)(e) de 
0 


0 
= f wE) (Gpo) (2) dr = (w| Gu) 
0 


Figure 5.18 Representation of the system with adjustable feedforward gain 
when using the control law of Eq. 5.40. Compare with Fig. 5.14(b). 
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Figure 5.19 A stable parameter adjustment law is obtained if GG, is SPR. 


where w = ru is in Le since u is in Le. Since G(s) is positive real, it follows 
from Example 5.12 that (w|Gw) > 0, which proves the result. oO 


By invoking the passivity theorem (Theorem 5.8) we can now obtain an 
alternative proof of Theorem 5.5. Figure 5.18 shows that the model-reference 
system can be viewed as a feedback connection of two systems. One system is 
linear with the transfer function G. It has the signal (@—6°)u, as the input and 
the model error as the output. The other system has the model error e as the 
input and the quantity —(@—6°)u, as the output. Since an integrator is positive 
real, it follows from Lemma 5.3 that the system H is passive. If the transfer 
function G is proper and strictly positive real, it follows from Example 5.12 
that G(s} is output strictly proper. The passivity theorem (Theorem 5.8) then 
implies that the closed-loop system is BIBO stable. In Fig. 5.18 there are no 
external inputs, as in Fig. 5.16. The system in Fig. 5.18 may have initial 
conditions, however, because the process and the model may have different 
initial conditions. The integrator may also have an initial condition that can 
be thought of as being generated by an external input signal. Such an input 
signal can always be chosen to be zero for t > 0. We thus have a situation 
covered by Theorem 5.6, where the input signal u is bounded in Lg. The error 
e(t) goes to zero as ¢ goes to infinity. Notice that the MRAS is stable for all 
values of y > 0 when the SPR condition is satisfied. This implies that the 
adaptation can be made arbitrarily fast. 


Design of Stable Adjustment Mechanisms 


The passivity theorem gives a convenient way to construct stable adjustment 
laws. We simply try to introduce some compensating network so that the 
transfer function relating the error to (8 — @°)u, is strictly positive real, as 
is illustrated in Fig. 5.19. For systems with output feedback, the problem is to 
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find a compensator G, such that the transfer function GG, is strictly positive 
real. This can be done by using the Kalman-Yakubovich lemma (Lemma 5.2). 
With pure feedforward control it is natural to assume that G is stable. It can 
then be written as 

B(s) 

A(s) 

where A(s) has all its zeros in the left half-plane. For a stable polynomial A(s) 
a polynomial C(s} such that C(s}/A(s) is SPR can always be found. To do this, 
we introduce the following canonical realization of 1/A(s): 


G(s) = 


—&1 —a2 t.. —in-l —n 1 
dx 1 0 0 0 0 
a = . x+]. |u 
0 0 1 0 0 
Choose a symmetric positive definite matrix Q and solve the equation 
ATP + PA=-@ 


The coefficients of a C polynomial such that C(s)/A(s) is SPR are then the 
first row of the P matrix. 

The polynomial C(s) will have a degree that is at most equal to deg A — 1. 
For systems with stable zeros and pole excess 1 it is thus possible to find a 
stable adjustment rule by choosing G,{s) = C(s)/B(s). However, for systems 
with higher pole excess than 1 the compensator required to make GG, strictly 
positive real will contain derivatives. We will show how to deal with the case 
in which the pole excess is higher by introducing the augmented error. 


PI Adjustments 


All adjustment laws discussed so far have been integral controllers. That is, 
the parameter has always been obtained as the output of an integrator. There 
are, of course, many other possibilities for choosing the adaptation mechanism 
H in Fig. 5.18. For instance, it can be expected that quicker adaptation can 
be achieved by using a proportional and integral adjustment law. This means 
that the control law of Eq. (5.40) is replaced by 


t 
O(t) = yue (telt) — y2 J ue(r)e(t) dr (5.57) 
Since a system with the transfer function 


H(s) = y1 + Y2/s 


is output strictly passive for positive yı and ya, it follows from Theorem 5.8 
(the passivity theorem) that Eq. (5.57} gives a stable adjustment law if GG, 
is positive real. 
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The Augmented Error 


Some progress has now been made to construct stable parameter adjustment 
rules for the problem of adjusting a feedforward gain. Passivity theory gave 
good insight and led to the idea of filtering the model error so that GG, is 
SPR. However, we have not solved the problem in which G has a pole excess 
larger than 1. To do this, we first factor the transfer function G as 


G = GGe 


where the transfer function G1 is SPR. The error e = y—7y,, can then be written 
as 
e = G(0— 6° )u. = (G1G2)(0 — 0°)uc 


= G1 (Go(@ — @°)u, + (8 — O°) Gru, — (8 — 0°}Goue) 
= G1 ((0 — 0°)Geu.) — Gi ((8 — 0°) Gou, — G2(0 — 6°)ue) 
Introduce the augmented error € defined by 
E=e+n 
where 77 is the error augmentation defined by 
n = G1(6 — 6°)Gou, — G(@ — 6l Jue 
= G\(8Geu,) — Géu, 


The second equality follows because G6°u = 6°Gu when @° is constant. The 
augmented error is thus obtained by adding a correction term 77 to the error. 
The correction term vanishes when the parameter @ is constant. It follows that 


€ = Gi ((8 — 8°) Gou,) = Gi(8 — 6°)z, (5.58) 


where i, is the reference signal filtered through Gz. Equation (5.58) is an error 
model similar to the ones used previously, and we have the following theorem. 


THEOREM 59 Stability using augmented error 


Consider a model-reference system for adaptation of a feedforward gain for a 
system with the transfer function G. Let GGe be a factorization of G such 
that G, is SPR. The parameter adjustment law 


de 
Pria ~y E (Gous) (5.59) 
where 
€ = e + Gi(@Gou,) — G (Oue) (5.60) 


gives a closed-loop system in which the error goes to zero as £ goes to infinity. 
Proof: Since G1 is SPR, the discussion of the error model shows that £ € La. 
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Model 


Figure 5.20 Block diagram of a model-reference adaptive system based on 
the augmented error. 


Remark 1. The trivial factorization with Gi = 1 is one possibility. 


Remark 2. If the input signal is persistently exciting, it can be shown that 
the parameters also converge. 


Remark 3. Notice that Gp must be minimum phase to establish that @ con- 
verges to 6°. The reason is that we have to go “backwards” through Ga to show 
that @ — 6° goes to zero if the output e goes to zero. That is, the inverse of Ge 
must be stable. This is a condition that will be seen again in the general case 
in Section 5.8. o 


A block diagram of the system with augmented error is shown in Fig. 5.20. 
To implement the augmented error, it is necessary to introduce realizations 
of the transfer functions G, and Gz. The augmented error was introduced by 
Monopoli. It was a key idea for adaptive control systems having pole excess 
larger than 1. Application of the idea to general linear systems is discussed in 
Section 5.8. In Section 5.9 we show that the augmented error appears naturally 
in the self-tuning regulator. 


Summary 


The problem of adjusting the gain in a known system has been used to intro- 
duce some ideas in the design of stable model-reference adaptive systems. It 
was first shown that adjustment rules could be obtained for systems in which 
the plant is strictly positive real, The parameter adjustment rules were similar 
to those obtained by the gradient method. 

The class of systems could then be extended by using adjustment rules 
in which the error is filtered. In this way the problem can be solved for sta- 
ble minimum-phase systems that have pole excess less than 1. The idea of 
augmented error was introduced to solve the problem of higher pole excess. 
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5.8 OUTPUT FEEDBACK 


We now derive an MRAS for adjusting the parameters of a controller based 
on output feedback in a fairly general case. A process with one input and one 
output is considered. It is assumed that the dynamics are linear and that the 
control problem is formulated as model-following. The key assumption is that 
the controller can be parameterized in such a way that the error is linear in 
the controller parameters. The derivation of the MRAS is described as follows: 


Step 1: Find a controller structure that admits perfect output tracking. 


Step 2: Derive an error model of the form 


e = Gy(p) {97 (t)(6° — 8)} (5.61) 


where G is a strictly positive real transfer function, 6° is the process pa- 
rameters, and @ is the controller parameters. The right-hand side should be 
expressed in computable quantities. 


Step 3: Use the parameter adjustment law 


dé 
ae PE (5.62) 
or the normalized law de 
ao Z, PE 
dt" at gto (5.63) 


Notice that the error £ in Eq. (5.61) is linear in the parameters, a condition 
that imposes restrictions on the models and controllers that can be dealt with. 
A model of the form (5.61) is typically obtained by algebraic manipulations, 
filtering, and error augmentation. 

We now show one way to apply the design procedure. 


Finding a Controller Structure 


The first step in the design procedure is to find a suitable controller structure. 
The tools for doing this were developed in Section 3.2. Let the process be 
described by the continuous-time model 


Ay(t) = bp Bu(t) (5.64) 


where it is assumed that the polynomials A and B do not have common factors 
and the polynomial B is monic and assumed to have all its zeros in the left 
half-plane. Furthermore, the polynomial is normalized so that B is monic. 
The variable bọ is called the instantaneous gain or the high-frequency gain. A 
general linear controller can be written as 


Ru(t) = —Sy(t) + Tu, (£) (5.65) 
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where u, is the command signal. Since the polynomial B is stable, the corre- 
sponding poles can be canceled by the controller. This corresponds to R = RB. 
The closed-loop system obtained when the controller is applied to the process 
(5.64) is described by 

(AR, + boS)y = bo Tu, (5.66) 


If polynomial T is chosen to be T = toA,, where A, is a stable monic polynomial 
and FR, and S satisfy 


AR, + bos = A, Ap, (5.67) 
it is possible to achieve perfect model-following with the model 


Amm(t) = Bototte(t) (5.68) 


The Error Model 
Having obtained a suitable controller structure, we now proceed to derive an 
error model. It follows from Eq. (5.67) that 

AjAmy = AR y+ boSy = Ri bo Bu + boy (5.69) 


where the first equality follows from Eq. (5.67) and the second from Eq. (5.64). 
Introduce the error 


e= Y— Ym 
It follows from Eqs. (5.69) and (5.68) that 
AoÁme = ApAm(Y — Ym) = bo (Ru + Sy — Tue) 


or 


This expression is not yet a suitable error model, because the transfer function 
bo/{AvAm) is not SPR. Therefore introduce the filtered error 


e= Se= S (yam) 


where Q is a polynomial whose degree is not greater than deg A,A,, such that 


bo@ 
AvAm 


is SPR. The filtered error can be written as 


(5.70) 
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Let P = P| Po, where Ps is a stable monic polynomial] of the same degree as R. 
Rewrite R/P as 

RR R-Pit Po 1 R ~ Po: 

P PR R +? 
The filtered error then becomes 

_ boQ 1 R — P; S T 
ef= AAn (et P u+59- 5u) 

Let k, L, and m be the degrees of the polynomials R, S, and T, respectively. 
Introduce a vector of true controller parameters 


T 


0? = (r4 ...r4 S0. -Sz to...tm) (5.71) 


where r; are the coefficients of the polynomial R — Pz. Also introduce a vector 
of filtered input, output, and command signals 


pe} 1 p“ 1 p™ 1 ) 
T— een 2.. 1 — 2- Z — yy Up s.. — >n He 
? (Fa “Boy” PUPO? PQ)" Pp)” 
(5.72) 
The filtered error can then be written as 
— boQ i T g0 
er = A (pute e) (5.73) 


To obtain an error model, we must introduce a parameterization of the con- 
troller. In the nominal case in which the parameters are known, the control 
law can be expressed. as 
u = —P;(9" 6°) = —P; (6°) p) = —(0°)" (Pig) (5.74) 
where Pi is a polynomial in the differential operator. Let @ denote the ad- 
justable controller parameters. The feedback law 
u = —Pi (p70) 


would give the desired error model. However, this control law is not realizable if 
P, has a degree greater than 1 because the term Pi(pT 8) contains derivatives 
of the parameters. However, the control law 


u = ~—6" (Pig) (5.75) 


is realizable because of Eq. (5.69). If we use this control law, it follows from 
Eq. (5.70) that the filtered error can be written as 


b 1 

ef = we. (ore -7 87 (Pag)) 
_ WR 
~ AgAm 


(ote — 978 — = 6” (Pip) + 70) 
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Introduce the signals 7 and £, defined by 


1 1 | 
1= 5 67 (Pig) — 976 =-— (5 u + ge) 


Pi 
bo@ y bo@ 
AvAn | Aom | 
The signal £ is called the augmented error, and 7 is called the error augmen- 
tation. The augmented error is computed as follows: 


bg 
AoAm ” 
With the chosen degrees of P and Q it is straightforward to verify that the 
computation does not require taking derivatives of the signals y, t, us and Ym- 
The error model of Eq. (5.76) is also linear in the parameters, and the transfer 
function boQ /(AsAm) is SPR. The error model thus satisfies the requirements 
of Step 2, and the parameters can then be updated by Eq. (5.62) or Eq. (5.63). 
So far, the derivation has been done along the lines developed in Sections 5.3 
and 5.4. However, to show the stability of the closed-loop system, it is not 
sufficient that the system (5.70) is SPR. It is also necessary that the signals in 
g are bounded. This condition can be difficult to show. Furthermore, Eqs. (5.76) 
are valid only if the control signal is generated from Eq. (5.75). This implies, for 
instance, that the control signal cannot be saturated. Notice that it is necessary 
to know the parameter by to compute the augmented error £, 

The derived algorithm thus requires that the high-frequency gain bo be 
known. If the parameter is not known, it can be estimated as follows. The 
error model of Eq. (5.73) can be written as 


(5.76) 


&=erf + p7 (8° —8) 


E= © (Y yn) + (5.77) 


e = bo (97 O° + us) (5.78) 
where 
_ Q 
_ R 
“= AAP 


A simple gradient estimator for 6) and 6° is then given by 


dé rs 
u Y DopfEp = YPfEp 


2 (5.79) 
aby — To 
“at =y (978 + up) Ep 
where £p is the prediction error 
Ep = & — & = e — bo (070 +up) (5.80) 


Notice that 6) can be absorbed in the adaptation gain if its sign is known. 
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Realization 
The equations needed to implement the general MRAS can now be summarized: 
Bn, 
Ym = A, c 
ep = Se= 2 (yn) 
1 T 
7=—- (5 ute a) 
bQ 
E= ert LA. 7 
dO pye 
dt — Yp 
u = —0" (Pig) 


A block diagram of the model-reference adaptive system is shown in Fig. 5.21. 
The block labeled “Filter” in Fig. 5.21 is a linear system that generates Pig 
and p from the signals ue, u, and y. The vector g is composed of three parts 
having the same structure. It therefore suffices to discuss one part. Consider, 
for example, how to generate ¢,, and PỌ, where 


and 


where P = P,P. and k = deg R = deg Po. 
Let the polynomials F; and Pz be 
Pi = p+ ape +--+ ay 
Po = p+ Bip" +---+ By 


We also assume that deg Pı > deg Pe. The vectors x and ¢, can then be realized 
as follows: 


ped 


—B, —Bo ... —Brr —Pr 


— = x u 
dt ; a, tj, 


> 
5S 
Leal 
So 
2 


Bu 
nN 
p= 
5 
© 
oS 
o 


—_—= z+ x 
dt Dot :) 
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Model 


Figure 5.21 Block diagram of a model-reference adaptive system for a SISO 
system. 


where x, = 1/P2-u is the last element of the x vector. The elements of pu 
are the & last elements of the state vector z. Furthermore, 1/P1-u can also be 
obtained from the generation of pu and Pipu. To generate the full vectors g 
and PỌ, we thus need three realizations of the transfer functions Py and P. 
The block labeled “Filter” in Fig. 5.21 represents these systems. 


Design Parameters 


Several parameters must be chosen in the design procedure: 
» The model transfer function B,,/Am, 
« The observer polynomial A,, 
» The degrees of polynomials R, S, and T, and 
e The polynomials P,, P2, and Q. 
Many different model-reference adaptive systems can be obtained by different 


choices of the design parameters. A popular choice of the polynomials is Py = 
Amn, P2 = Ao, and Q = AcAm. 


A Priori Knowledge 


To apply the MRAS procedure, the plant must be minimum-phase and the 
following prior information must also be known: 


» The sign of the instantaneous gain bo, 
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Figure 5.22 Simulation of the system in Example 5.14. (a} The process 


output (solid line) and the model output (dashed line}. (b} The control signal. 
(c) The error e = y— Ym- 


» The pole excess of the plant, and 
« The order of the plant or the controller complexity. 


EXAMPLE 514 Second-order MRAS 


The performance of the general MRAS is illustrated by a second-order example, 
given the system 


k 
a s(s + a) 
and the model r 
Bm _ @ 
Smale) = An  s%+2las + w? 


The polynomials A,, R, S, and T can be chosen to be 


A,(s} = s + 0o 
R(s)=s+ri 
S(s) = sos + 51 
T(s) = tos + ty 
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0 50 100 150 
Time 


Figure 5.23 The controller parameters in the simulation of the system in 
Example 5,14. 


The Diophantine equation (Eq. 5.67) gives the solution 


rı = 26a+a,-—a 
so = (26a, + w —arı)/k 


fo = ofk 


For simplicity we choose 
Q (s) = Ao(s)An(s) 
Pi(s} = Am{s} 
Po(s) = Ao(s) 


Figure 5.22 shows a simulation of the system with y = 1, ¢ = 0.7,@ = 1, a, = 2, 
a = 1, and k = 2. In the simulation it is assumed that 5) = bo. The used values 
of the filters Pi, P2, Q, and A, give a fairly rapid convergence of y to Ym. The 
parameter estimates at the end of the simulation are still far from the optimal 
values, but the error is small (see Fig. 5.22c). The controller parameters are 
shown in Fig. 5.23. The control law at ¢ = 150 gives a closed-loop system with 
a pole in —1.95 and two complex poles corresponding to @ = 0.84 and ¢ = 0.78, 
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which should be compared to the roots of A,A,,, which are in —2, and complex 
poles corresponding to w = 1 and ¢ = 0.7. o 


5.9 RELATIONS BETWEEN MRAS AND STR 


For a long time, model-reference adaptive systems and self-tuning regulators 
were regarded as two quite different approaches to adaptive control. In this 
section we will show that the methods are closely related. The key observation 
is that the direct self-tuner in which process zeros are canceled (Algorithm 3.3) 
can be interpreted as a MRAS. 

An MRAS for a genera! continuous-time linear system was derived in 
Section 5.8. In the derivation it was assumed that the process was minimum- 
phase and that all its zeros were canceled in the design. We showed that the 
adjustment law for updating the parameters can be written as 


d8 
Jr Vere (5.81) 


where Øp is a filtered regression vector and £ is the augmented error given by 
Eq. (5.77), that is, 
=p 8 y- Ym) +A ey (5.82) 
Now consider a discrete-time direct self tuner. When all process zeros are 
canceled, polynomial BT is a constant and we get the process model 
y(t) = pF (t— do) @ 


In the direct algorithm the estimated parameters are equal to the controller 
parameters. The least-squares method can be used for the estimation by using 
the residual 


E(t) = y(t) — H(t) = y(t) — oF (t — do) Ot — 1) 


The parameter update can be written as 


6(t) = 6(¢ —1) + Pitot (t — do} e(t) (5.83) 
The residual is given by 
E(t) = y(t} — I(E} = y(t) — y(t) + Ym (t) — 9 = et) +) (5.84) 


A comparison of Eqs. (5.81) and (5.83) show that Eq. (5.83) can be interpreted 
as a discrete-time version of Eq. (5.81). Notice that the gain y in the MRAS 
is replaced by the matrix P(t}. This matrix changes the gradient direction pf 
and gives an appropriate step length. Also notice that it follows from Eq. (5.84) 
that the error augmentation is simply y—¥. The augmented error that required 
a significant ingenuity to derive in the MRAS context is thus obtained directly 
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from the least-squares equations in the STR. More filtering is required in the 
MRAS because of the continuous time formulation. Notice that it follows from 
Eq. (5.83) that 
p7 (t — do) = — grade £(t) 

The vector p? (t — do) can be interpreted as the sensitivity derivative of the 
prediction error £ with respect to the parameter. The parameter update of 
Eq. (5.83) is thus a discrete-time version of the MIT rule. The main difference 
is that the model error e(t) = y(t) — yp,(t) is replaced by the prediction error 
e(t). 

Notice that in the identification-based schemes such as self-tuning con- 
trollers we normally attempt to obtain a form similar to 


y(t) = 976 
With the model-reference approach, it is also possible to admit a model of the 


form 

y(t) = G(p) (p78) 
where G(p) is SPR. In summary we thus find that the MRAS-type algorithms 
can be obtained in a straightforward way as a direct self-tuning regulator based 
on a minimum-degree pole placement design with cancellation of the whole B 
polynomial. 


5.10 NONLINEAR SYSTEMS 


The Lyapunov method can also be used to find adaptive control laws for non- 
linear systems. This is a difficult problem because no general design methods 
are available. There is, however, much interest in adaptive control of nonlinear 
systems. For this reason we present some of the current ideas and illustrate 
them by a few examples. 


Feedback Linearization 


Before attempting to do adaptive control, we must first have a design method 
for the case in which the parameters are known. Feedback linearization is a 
design method that is similar in spirit to pole placement. It can be applied to 
certain classes of systems. We illustrate it through an example. 


EXAMPLE5.15 Feedback linearization 
Consider the system 


d 
T =t Flo) 
dxa 


dt 
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where f is a differentiable function. The first step is to introduce new coordi- 
nates 


£1 = X1 
Èo = x2 + f(x) 
The equations then become 
dé, _ 
u Ša 
déz 


a = of (oi) +u 


By introducing the control law 


u = —a26) — argo — Éz f (G1) +v 
we get a linear closed-loop system described by 


a= (Wy m) (1) 


This system is linear with the characteristic equation 
s+ &as + a = Ô 


By transforming back to the original coordinates the control law can be written 
as 


u = =f] — (a1 + F) (x2 + f(x1)) +0 O 


The closed-loop system obtained in the example will behave like a linear 
system. This is the reason why the method is called feedback linearization. 
The system in Example 5.15 is quite special. Applying the same procedure for 
a system described by 

T = f(x) +u9(x) 
we first pick 
$1 = h(x) 


as a new state variable. The time derivative of £, is 


Ht = n(x) (F(x) + uala) 


If h'(x}g(x) = 0, we introduce the new state variable 
čz = h'(x) f(x) 


We proceed as long as the control variable u does not appear explicitly on the 
right-hand. side. In this way we obtain the state variables €,...¢,, which are 
combined to the vector č € R”, where r < n. We also introduce the new state 
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variable 71 ...7n-+, which are combined into the vector 7 € R”. This can be 
done in many different ways. We obtain the following system of equations: 


He 
dëz 
m 7S 
: (5.85) 
dé, 


dt = a (En) + up (¢.77) 


fy (En) 


Notice that the state variables € represents a chain of r integrators, where the 
integer r is the nonlinear equivalence of pole excess. The variables 7 will not 
appear ifr = n. This case corresponds to a system without zeros. This actually 
occurs in Example 5.15, where r = n = 2. 

A design procedure, which is the nonlinear analog of pole placement, can 
be constructed if 6(€,77) # 0. If this is the case, we can introduce the feedback 
law 


"= TET (0,81 — a,_162 —.-.—ay€, — (6,7) + bou) 


The closed-loop system then becomes 


0 1 0 n. O 0 
dé 0 0 1 wee 0 
a : E+ E |? 
. (5.86) 
- may bg 


—ür —G@r-1 Or 
dn _ 
dt = ¥ (6,7) 


The relation between v and £ is given by a linear dynamical system with the 
transfer function _ 
Ei(s) _ bo 


V(s) sS + asl +H... ar 


This differential equation has a triangular structure. The part corresponding 
to the state vector Ë is a linear system that is decoupled from the variable 7. 
If £ = 0, the behavior of the system (5.86) is governed by 


G{s) = 


a (0,1) (5.87) 


This equation represents the zero dynamics. It is necessary for this system to 
be stable if the proposed control design is going to work. For linear systems 
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the zero dynamics are the dynamics associated with the zeros of the transfer 
function. Feedback linearization is the nonlinear analog of pole placement with 
cancellation of all process zeros. 


Adaptive Feedback Linearization 


We now show how feedback linearization can be extended to deal with the 
situation in which the process model has unknown parameters. The approach 
will be similar to the idea used to derive model-reference adaptive controllers. 
Let us start with an example that is an adaptive version of Example 5.15. 


EXAMPLE'5.16 Adaptive feedback linearization 
Consider the system ae 
“ay = x2 + OF (ex) 


where @ is an unknown parameter and f is a known differentiable function. 
Applying the certainty equivalence principle gives the following control law: 


u = —azx, ~ (a1 + OF'(x1)) (x2 + OF (x1)) +v (5.88) 


Introducing this into the system equations gives an error equation that is non- 
linear in the parameter error. This makes it very difficult to find a parameter 
adjustment law that gives a stable system. Therefore it is necessary to use 
another approach. 

Proceeding as in Example 5.15 and introducing the new coordinates 


1 = x3 
2 = x2 + 6 f (x1) 


where Ĝĝ is an estimate of @, we have 


oe = a = x2 + Of (x1) = 2 + (8 — ô) f(A) 


d 
De _ DO (xs) + (x2 + OF (21) f(a) + 4 
Choosing the control = to be 


u = a6) — 0162 — Ê(x2 + Êf (x1)) f’(x1) — f1) a +v (5.89) 


alaaa a BE 


we get 
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A comparison with the certainty equivalence control law given by Eq. (5.88) 
shows that the major difference is the presence of the term dé/ dt in Eq. (5.89). 

In analogy with the model-reference adaptive system, let us assume that 
it is desired to have a system in which the transfer function from command 


signal to output has the transfer function 

a2 

G(s) = ————_ 
(s) s* + a18 + a2 


Introduce the following realization of the transfer function: 


anf z] A(R) 
dt ~ | -a -a )™ az) 


and let e = Ë — xm be the error vector. If we choose 
U = Agim (5.90) 
we find that the error equation becomes 


de ( Jef f(g) 


dt = ôfrin ) Ae + BP 


—a2 —aı 
where 
i) z- (ale 
—az —aı Off (E) 
The matrix A has all eigenvalues in the left half-plane if a, > 0 and az > 0. It 
is then possible to find a matrix P such that 
ATP +PA=-—I 
Choosing the Lyapunov function 


Vine Pete 


we find that J 
V TUAT 2 .d6 
= e? (A7 P + PA)e + 28B7 
E =e ( + je +28B" Pe+ 7 6 de 
If the law for updating the parameters is chosen to be 
Jê 
a = 7B" Pe 
we find that _ ; 
dð d dê T 
ggl A=- = 7B Pe 
and the derivative of the Lyapunov function becomes 
dV T 
da eF 


This function is negative as long as any component of the error vector is 
different from zero. With the contro! law given by (5.89) and (5.90) the tracking 
error will thus always go to zero. 
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Backstepping 


Unfortunately, adaptive feedback linearization cannot be applied to all sys- 
tems that can be linearized by feedback. The reason is that higher derivatives 
of the parameter estimate will appear in the control law for systems of higher 
order. There is, however, another nonlinear design technique called backstep- 
ping that can be used. We first introduce this method and-later show how it 
can be used for adaptive control. In feedback linearization we introduced new 
state variables and a nonlinear feedback so that the equations describing the 
transformed variables had a particular structure. A similar idea is used in 
backstepping, but the transformed equations have a different form. To show 
the key ideas without too many technical complications, we consider a simple 
stabilization problem. To simplify the writing, we frequently drop the argu- 
ments of functions. 


EXAMPLE 5.17 Stabilization by backstepping 
Consider the system described by 


dx 
ap =r + f(a) 


Da aie, (5.91) 


Introduce £, = x1. Then 


a = x2 + f (1) = —Şı + x2 + 1 + FSi) 


If we introduce the function 
a(i) = 61 + f (Ex) 


and the state variable 
2 = x2 +01 (61) (5.92) 


the differential equation for €; can be written as 


d 
oe = —ğı +62 
The derivative of the variable Pe is given by 


ae ant +B aaa 


If we introduce the function 


d2(1,62) = 2 + ra (—¢1 + č2) 


dE, 1 (-E1 + E2) 
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and the state variable 
a = xg + ae(&1, &2) 


the differential equation for 2 can be written as 


oe =—¢2 +63 
Taking derivatives of €; and using Eqs. (5.91), we find that 
d3 4. 2a 


ho +i ($1 + $2) +2 BE, = (Ez + és) 


Introducing the function 


a3($1,$2,¢3) = Eq n E, = (£1 + &2) ĉe aE 2 (~£2 + ča) 
we find that the differential equation for 3 can be written as 


Be L g + aln éng) +u 


The feedback 
u = —as($1, 2 $3) 
gives the closed-loop system described by 


d -i1 1 0 
a=} o -1 ıfë (5.93) 
0 0 -1 


This system is clearly stable, and its state € goes to zero exponentially. Notice 
that by a slight modification of the procedure we can have any number in the 
diagonal of the system matrix. 

The transformation was obtained recursively. Notice that if the variable xz 
was a control variable that could be chosen freely, the “control law” 


x2 = —a (ğı) 


would give 


cm 


The state variable £2 defined by oe (5.92) can thus be interpreted as the 
difference between xz and the “stabilizing feedback” —a,(é,). 


Similarly, if x3 was a control variable that could be chosen freely, the 
“control law” 


X3 = —az (1, 2) 


5.10 Nonlinear Systems 251 


would give the closed-loop system 


"hh A =~—€,+é 
déz 
He =— é 


The state variable ¢; can be interpreted as the difference between xg and the 
“stabilizing feedback” —ag(&1, &2)- 

The procedure was originally derived by applying this reasoning recur- 
sively, and the name “backstepping” derives from this. 

In the example the system was transformed to a triangular form given 2 
Eq. (5.93). There are many other possibilities. 


Adaptive Backstepping 


The key idea of backstepping is to derive an error equation and to construct a 
control law and a parameter adjustment law such that the state of the error 
eguation goes to zero. The idea is illustrated by a simple example. 


EXAMPLE 5.18 Adaptive stabilization by backstepping 


Consider the system 


Mi 5 OFC) 


where f is a known function and @ an unknown parameter. We derive a control 
law that stabilizes the system when the parameter @ is unknown. Introduce a 
new state variable £; = xı. We write the derivative of £} as a sum of terms 
in which one of them depends on known quantities only. For this purpose we 
introduce the parameter estimate Ê and the error 6 = @ — @. The derivative of 
€, then becomes 


ai A = —E, +1 +x + Of(E:) + Öf (41) 


Introduce the next state variable £2 as 


Ez = x2 + alé, Ê) 


where 


a1(§1, ô) = 1 + ÔF ($1) (5.94) 
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The differential equation for €; can then be written as 


ao =$ +2 + Õf (5.95) 


We now proceed to rewrite the derivative of 2 as a sum of two terms in which 
the first depends only on £1, č2, and 6. Hence 


den _ ða dy Oa, dé 


dt of, dt ' a6 dt 


Equation (5.95) gives the desired separation of terms in dé,/dt. Some work is 
required to obtain a similar expression for d@/dt. We have 


déz day 


de St ag (oi toe + OF) + da, dO 


06 dt 
Following the idea of backstepping, we consider xg to be a control variable that 
can be chosen freely. The Lyapunov function 

2V = E 4 e724 6" 


can be used to find a control law and an adaptation law that stabilizes the 
error equation for variables č; and £». After some calculations we find that the 
derivative of V is given by 


(5.96) 


WY eh 4 Suda tas (Ga SA) a (artt 2 ree) - 2) 


The term containing @ can be eliminated by choosing 


de = ba{di, ča) 


where 


be = Ei f (1) + é, Oe Bey 2 fë) (5.97) 


The function bz (#1, 2) can be interpreted as a good way to choose the parameter 
update rate d0/di based on £1 and £s. The “control variable” x3 can be chosen 
to give 

dV 

W g-e 
Using bz as an estimate of dO/dt, we now rewrite Eq. (5.96) as 


Oe 1— Etas té tét Se (i+ +67) 


+ 901 pp San (S — b) (5.98) 
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Now define 
ae(&1, 62,6) = 2, +624 F (—€1 + &2) + = be (5.99) 


and introduce the state variable s as 


§3 = Xg + @e(er, Čz, ô) 
The differential equation (5.98) can be written as 


déz _ Gay dé 
a. —O)~Ee+&+ 5 BE; 6f ++ — a (ae - b) (5.100) 
The derivative of 3; becomes 
dé; Aag Baz dëz Bag dé 


Notice that the control variable u now appears explicitly on the right-hand side. 
In the stabilization problem the error is equal to the vector € and the error 
equation is obtained by combining Eqs. (5.95), (5.100), and (5.101). Following 
the general MRAS approach, we now attempt to find a feedback law and a 
parameter adjustment rule that stabilizes the error equation. Choosing 


2V = 64+ ++ 6 


as a possible Lyapunov function, we get, after straightforward but tedious 
calculations, 


ov set — čz + eds +9275 ĝa a (8 — bz) 


day day dô 
+a (u+ 76 (a8, + 76 S) 


+6 (Gut tink fs (Fee oe aE) -F 


061 


The term that contains 6 can be eliminated by updating the parameters in the 
following way: 
0a 


a = Sil + EER f + e(o1,62)&3 (5.102) 


where 


Ena = (ag + ag aes) f 


Furthermore, introducing 
b3(21, 62,63) = be + c&s 
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and 
Oae Gag ( 2 Oa, ) Gaz 
= fo +&3 + =" (—€1 + €2) + = ~ — & —~ b 
as =2+és aE, | $1 + 2) OE, —ő1 — čz + és — $3 O°) + 3G 88 
we find that Jô 
8 
AE — bs = c&3 
The derivative of the Lyapunov function can then be written as 
dV 
a -6f — 83 — $3 + fa(u + as) 
The feedback law 
u = —a3{¢1,$2,$3) (5.103) 
gives J 
V 
V --8-8-8 
and we find that dV /dt is negative as long as |é| # 0. O 
Summary 


The examples given should give some of the flavor of nonlinear adaptive con- 
trol. The results obtained depend on clever changes of coordinates. A reason- 
able characterization of the class of systems in which the methods apply is not 
available. Nevertheless, we can make some interesting observations from the 
examples. First, we can notice that the adaptive control laws that are obtained 
differ significantly from those obtained from the certainty equivalence prin- 
ciple. In the nonlinear approaches the control law and the rule for updating 
the parameters are obtained simultaneously. An estimate of the rate of change 
of the parameters appears in the feedback law. Many problems remain to be 
solved. 


5.11 CONCLUSIONS 


The fundamental ideas behind the MRAS have been covered in this chapter, 
including 

e Gradient methods, 

e Lyapunov and passivity design, and 

« Augmented error. 
In all cases the rule for updating the parameters is of the form 


10 _ ay 
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or, in the normalized form, 


dé, _ PE 
dt l u+gTy 


In the gradient method the vector g is the negative gradient of the error with 
respect to the parameters. Estimation of parameters or approximations may be 
needed to obtain the gradient. In other cases, g is a regression vector, which is 
found by filtering inputs, outputs, and command signals. The quantity £ is the 
augmented error, which also can be interpreted as the prediction error of the 
estimation problem. It is customary to use an augmented error that is linear 
in the parameters. 

The gradient method is flexible and simple to apply to any system struc- 
ture. The calculations required are the determination of the sensitivity deriva- 
tive. Since the sensitivity derivative cannot be obtained for an unknown pro- 
cess, it is necessary to make several approximations. The initial values of the 
parameters must be such that the closed-loop system is stable. Empirical evi- 
dence indicates that the system is stable for small adaptation gains but that 
high gains lead to instability. It is difficult to find the bounds. In Chapter 6 
we give more insight into the properties of the gradient method. 

A general MRAS is derived in Section 5.8 on the basis of the model- 
following design in Chapter 3. This algorithm includes as special cases many 
of the MRAS designs given in the literature. The estimation of the parameters 
can be done in several ways other than those given in Eqs. (5.62) and (5.63). 
Various modifications are discussed in Chapter 6. 


PROBLEMS 


5.1 Consider the process 
1 
G(s) = s(s+a) 
where a is an unknown parameter. Determine a controller that can give 
the closed-loop system 


o? 


Gm (s) = 5? + 2% ws + w? 


Determine model-reference adaptive controllers based on gradient and 
stability theory, respectively. (Compare Problem 3.2.) 


5.2 Consider the simple MRAS in Fig. 5.4 with G = 1/s. Let the parameter 
adjustment law be Eg. (5.57) (i-e., of PI type). Determine the differential 
equation for 0, and discuss how y; and ys influence the convergence rate. 


256 


5.3 


5.4 


5.5 
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Consider a position servo described by 


where parameters a and 6 are unknown. Assume that the control law 
u = y(u: — y) — Ogu 


is used and that it is desired to control the system in such a way that the 
transfer function from command signal to process output is given by 


a 


Gm(s) = 2 + 2f8 + w? 


Determine an adaptive control law that adjusts parameter 8; and 62 so 
that the desired objective is obtained. 
An integrator 
b 

G,(s) = z 

is to be controlled by a zero-order continuous-time controller 
u(t) = —soy(t) + toue(t) 

The desired response model is given by 


Bm 
S + Om 


Grn(s) = 


Derive, using the Lyapunov theory, a parameter update law of an MRAS 
guaranteeing that the error e = y — ym goes to zero. Try the Lyapunov 
function 1 i i 

ve) =} G + Š (bso—an)” + + (bto — bn)*) 


where 
e(t) = y(t) — Ym (£) 


Consider the problem of adaptation of a feedforward gain in Example 5.1 


when 
1 


(s+ 1)(s + 2) 
(a) Introduce the augmented error, and determine an MRAS based on 
stability theory. 


(b) Show that the derived adaptation law in part (a) gives a stable closed- 
loop system. 


G(s} = 


5.6 


5.7 


5.8 


5.9 


5.10 


5.11 
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Determine conditions in which a second-order transfer function 
bos? + bis + be 
G(s) = > 
8* +018 + Ga 
is strictly positive real. 


Show that B{s)/A(s) is SPR if A(s) is a stable polynomial and the B 
polynomial is the first row of the P-matrix defined by the Lyapunov 
equation 

ATP + PA=-Q 


where the matrix A is 


—0; —Og in- On 
I 0 0 0 
A= 
0 0 1 0 


and Q is a symmetric positive definite matrix. Show that the system of 
equations for solving pı, pe, and py in Example 5.6 has a unique solution 
only if all the eigenvalues of A are in the left half-plane. 


Show that the transfer function 

G(s) =l+s 
is SPR and ISP but not OSP. 
Show that the transfer function 


G(s) = 


s+l 
is SPR and OSP but not ISP. 
Show that the transfer function 


G(s) 


is OSP and ISP but not SPR. 
Consider the system 


_ 4) 
~ (6 +1)? 


G(s) = Gi(s)G2(s) 
where 


Gi (s) = 


s+ 
c 


s+ 


Ga (s) = 


a, 
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where a and 6 are unknown parameters and c and d are known. Dis- 
cuss how to make an MRAS based on the gradient approach. (Compare 
Problem 3.3.} Let the desired model be described by 


Q 


Gm (s) = 8 +2č0s + w? 


5.12 A process has the transfer function 


5.13 


5.14 


5.15 


5.16 


5.17 


b 
G(s) = als +1) 


where b is a time-varying parameter, The system is controlled by a pro- 
portional controller 
u(t) = k (ue(t) — y(t)) 


It is desirable to choose the feedback gain so that the closed-loop system 
has the transfer function 


1 
st+st+l 


Design an MRAS that gives the desired result, and investigate the system 
by simulation. (Compare Problem 3.4.) 


The general MRAS procedure in Section 5.8 was derived for known instan- 
taneous gain bọ. If bp is unknown, we may use the following augmented 


error: 
- a ( (bo - be) GA x) + boy’ (0- e)) 


where 6, is the estimate of bp. Discuss how this augmented error can be 
obtained and how it may be used to update the parameters bp and 0. 


G(s) = 


E€ 


Study the parameter adjustment law in Example 5.2. Make a simulation 
program that implements the adaptive system. Repeat the simulation in 
Fig. 5.5. Investigate the behavior of the parameters and the error. Explore 
how the behavior is influenced by the adaptation gain y. 


Repeat the simulation in Problem 5.4 for different types of input signals. 
Change the amplitude and the nature of the signals. Can you find values 
of the adaptation gain that work well for different inputs? 


Consider the system in Example 5.5. Assume that u, is a step that implies 
that ym will be time-varying. Investigate by analysis or simulate the 
stability Limit and compare with the limit obtained in the example, in 
which u, and ym were constant. 


Consider a first-order system with the transfer function 


b 
sta 


G(s} = 


5.18 
5.19 


5.20 


5.21 


5.22 
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where a and 6 are unknown parameters. Assume that the system is 
controlled by the control law 


t = Ott, — O2y 


Compare by simulation the properties of the systems obtained with the 
MIT rule and the one derived from Lyapunov theory. Use the same 
parameter values as in Example 5.2. (Hint: The algorithms are given 
in Examples 5.2 and 5.7. } 


Investigate the properties of the system in Example 5.7 by simulation. 


Investigate through simulation the convergence rate of the parameters 
in Example 5.2 when the control law of Eqs. (5.9) is used. How will the 
parameter adjustment change if an adaptation rule based on stability 
theory is used? For instance, plot the phase plane for the parameters. 


Consider the process 
50 
G(s) = s(s + 4) 
and the criterion 
| ((y — ue)? + pu?) dt 
0 
Let the control law have the form 


u(t) = —soly — ue) 
or + 
SoP t $1 
t) = -2 [y-u, 

u(t) = -IPE yun) 
Determine the controller parameters through explicit minimization of the 
criterion, and let the gradients be obtained from an estimated model of 
the process. (Hint: See Trulsson and Ljung, 1985.) 


Consider the system in Example 5.14. Figure 5.22(c) shows the rapid 
decrease in the error, while the parameters converge much more slowly. 
Explain the slow parameter convergence by analyzing the sensitivity of 
the closed-loop poles with respect to the estimated parameters. 


Consider a system described by 
b 


| C8) = 340 

where a and b are unknown parameters. Find a simple control law that 
can control the plant well, and derive an adaptive algorithm that gives 
good performance. 
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CHAPTER 6 


PROPERTIES OF 
ADAPTIVE SYSTEMS 


6.1 INTRODUCTION 


Some theoretical problems were discussed in earlier chapters in connection 
with description or derivation of specific algorithms. In particular we used 
equilibrium analysis to analyze the self-tuners and stability theory to derive 
some model-reference algorithms. In this chapter we attempt to bring together 
theory of a more general character. The theory has several different goals: 


e To present some mathematical tools that are useful in analysis of adaptive 
systems. 


e To analyze the behavior of adaptive systems in nonideal cases. 
« To give ideas for new algorithms and for improvement of old algorithms. 


In this chapter we focus on the first two issues. The behavior of specific 
algorithms can be understood through analysis of stability, convergence, and 
performance, Stability proofs require certain assumptions. It is also of consider- 
able interest to understand what happens when the assumptions are violated. 
Analysis of performance may give useful msight into performance limits; it is 
helpful to know whether the performance of a particular algorithm is close to 
the theoretical limits. A good theory should also give clues to the construction 
of new algorithms. 

Unfortunately, there is no collection of results that can be called a theory 
of adaptive control in the sense specified. There is instead a scattered body 
of results, which gives only partial results. One reason for this is that the 
behavior of adaptive systems is quite complex because of their time-varying 
and nonlinear character. Readers who are familiar only with linear systems 
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theory, in which most problems can be answered in great detail, should thus 
be warned. 

The closed-loop systems obtained with adaptive control are nonlinear and 
sometimes also stochastic. Such systems are also very difficult to analyze. To 
obtain some insight with a reasonable effort, it is therefore necessary to make 
some simplifications. It is often possible to analyze the equilibrium conditions. 
The local behavior in the neighborhood of the equilibria can also be explored by 
using linearization. The global behavior of the systems can, however, be very 
complex, particularly if the design parameters are chosen badly. 

In Section 6.2 we show that the adaptive control problem has a special 
nonlinear structure that can be exploited in the analysis. We first show that 
very complex, even chaotic, behavior can be observed if the adaptation gain is 
chosen to be too high. 

Section 6.3 presents an analysis of a system with adaptation of a feedfor- 
ward gain. Such systems can be described by linear time-varying systems in 
which the time variation originates from the command signal. The particular 
case of periodic variations can be dealt with by so-called Floquet theory. The 
analysis reveals that very complex behavior can be obtained even in this simple 
case. 

The properties of indirect discrete-time adaptive systems are investigated 
in Section 6.4. In this case it is natural to investigate parameter estimation and 
the control design separately. There is interaction between these problems be- 
cause the identification is done in closed loop and the control design influences 
the signals generated by feedback. The analysis brings out the importance of 
persistency of excitation and the dangers with singularities in the control de- 
sign. A consequence of this is that it is desirable to have as few parameters 
as possible and to have external excitation. In Section 6.5 we make a similar 
analysis of the direct algorithm. One of the conditions required for the proof 
is that the complexity of the model used must be at least as complex as the 
process to be controlled. A characteristic feature of direct adaptive algorithms 
is that the closed-loop behavior can converge to the desired behavior even if 
the parameters do not converge. 

It is reasonable to assume that if the adaptation rate is small, the param- 
eter estimates will change more slowly than the other variables in a system. 
The closed-loop system can then be viewed as having different time scales. 
This has been emphasized in the descriptions of both the self-tuning regulator 
and the model reference adaptive controlier. The analysis can then be simpli- 
fied by considering the slow and fast modes separately. Averaging analysis is 
a good analytical tool for this. A short presentation of this technique is given 
in Section 6.6. A significant advantage of the averaging technique is that it 
makes it possible to reduce the dimensionality of the problem to the number 
of parameters in the algorithm. The averaging method also makes it possible 
to explore the behavior in detail. A drawback of the averaging results is that 
they hold for small adaptation gains but the theory does not give quantitative 
results about smallness. 
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It is a characteristic feature of feedback that a controller can often be 
designed by using a simplified model of a real process. This is one of the 
reasons why automatic control has been so successful in applications, So far, we 
have analyzed the behavior of some adaptive algorithms under the simplifying 
assumption that the structure of the process is the same as the model used to 
design the adaptive controller. Having obtained the tool of averaging, we are 
in a position to investigate the consequences of the simplifying assumptions 
made in the earlier sections, and we can explore how adaptive systems behave 
in the presence of unmodeled dynamics, that is, when the order of the process is 
different from that of the model used to derive the adaptive controller. Analysis 
of several examples in Section 6.7 leads us to various modifications of the 
algorithms that will improve their robustness to unmodeled dynamics. 

In Section 6.8 we show that averaging techniques can be used to analyze 
stochastic self-tuning regulators. The equilibrium points of the algorithms and 
their local behavior can often be obtained without too much effort. In Section 
6.9, different ways are discussed to make the adaptive algorithms robust with 
respect to the assumptions made in the idealized cases. 


6.2 NONLINEAR DYNAMICS 


We have mentioned several times that adaptive systems are inherently non- 
linear. A natural approach to understand the behavior of adaptive systems is 
thus to use tools from the theory of nonlinear dynamical systems. We first 
investigate the structure of adaptive systems. This reveals that they have a 
very special structure. Some tools from dynamical systems are then reviewed 
briefly, and we apply them to a very simple system. This analysis reveals that 
adaptive systems behave in the expected way when the adaptation gain is 
small. However, the behavior can be very complex for large adaptation gains. 
The analysis also indicates the difficulties involved in the approach. We also 
investigate the special case of adaptation of a feedforward gain. In this case 
the problem is simplified significantly because it can be described as a linear 
time-varying system. A reasonably complete analysis can be performed when 
the command signal is periodic. This analysis reveals that the system is well 
behaved for small adaptation gains but that the behavior is quite complex for 
large adaptation gains. 


Structure of Equations Describing Adaptive Systems 


Consider a process controlled by an indirect adaptive controller as shown in 
Fig. 3.1. We will first consider the case in which parameters of a continuous- 
time model are estimated by using a gradient procedure. Assume that the 
system to be controlled is linear. Let 2° denote the controller parameters and v 
the external driving signals. The signal v is typically composed of the command 


266 Chapter 6 Properties of Adaptive Systems 


signal u, and nonmeasurable disturbances acting on the process, With constant 
controller parameters the closed-loop system can be written as 


d = A(V) + B(B)v 
(6.1) 


n= (<) = C(OË + D(B)v 


The state vector € includes the states of the system, the reference model, the 
data filter, and the auxiliary state variables that may have to be introduced 
to calculate the error e and the regression vector ø used in the parameter 
adjustment mechanism. The vector 77 consists of the error and the regression 
vector that are used by the parameter estimator. 

Furthermore, let @ denote the process parameters. A normalized gradient 
scheme for estimating the parameters can be described by 


dô 8, Eje, E) 
dt “a+ p(8,6)* (8,4) 


The control design can be represented by a nonlinear function 3 = xê), which 
maps the estimated parameters into controller parameters. This map becomes 
the identity for direct algorithms. 

For constant 2 the system (6.1) is linear. The solution can then also be 
characterized by the operators G,, and Goy, which relate e and ọ to v. These 
operators depend on the controller parameters 3. Equation (6.2) can then be 
written as 


(6.2) 


dé (Gv) (GeV) 

dt a+ (Gov)! Gov 
The adaptive system is thus described by Eqs. (6.1) and (6.2), which have a 
very special structure. Equation (6.1) is linear in the states and the external 
driving signals, The controller parameters appear in the coefficients of matrices 
A, B, C, and D. Nonlinearities appear in the product ge in Eq. (6.2), in the 
design map 7, and in the functions A(?), B(3), C(8), and D(#) in Eq. (6.1). 
The equations for an adaptive system have a similar form in the discrete-time 
case. For a system with recursive least-squares estimation the equations can 
be written as 


E41) = AED + BOO 
_({e)_ 

nl = [27] -coo + pewa 

A(t + 1) = 6(t) + P(t + De@e(s) 

P(t+1) = PE -PONA + OPONT PO) 


It is useful to try to exploit the special structure of the equations to get a deeper 
understanding of adaptive systems. One special feature is that the state of the 


(6.3) 
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closed-loop system is naturally separated into two parts, ë and 6. Moreover, it 
is reasonable to assume that @ changes more slowly than ¢. 


Analysis 


Let us briefly summarize how a nonlinear system such as Eqs. (6.1) and (6.2) 
or Eqs. (6.3) can be analyzed. It is a comparatively simple task to find the 
equilibrium solutions by solving the algebraic equations 


dé 
— = 
di 
dé 
7 = 9 


for continuous-time systems. For discrete-time systems the equivalent egua- 


tions become 
S(t + 1) =¢() 
6(t + 1) = A(t) 

It may happen that proper equilibria do not exist. In such cases there 
may be integral manifolds where the parameters 6 are constant although 
the state € varies with time. We are then led to averaging analysis, which is 
discussed in depth in Section 6.6. Equilibria having been found, it is natural to 
determine the local behavior by linearizing the equations around the equilibria 
and applying standard linear theory. A complication is that critical cases in 
which the eigenvalues are zero frequently occur. Having determined possible 
equilibria, we can proceed to investigate how the nature of the equilibria 
changes with important parameters of the system. It is of particular interest 
to investigate changes in which the nature of the local equilibria changes 
(bifurcation analysis). When the local properties are investigated, it is natural 
to proceed to find the global properties. There are no general tools for this, and 
we have to resort to simulations and approximations. Phase plane analysis is 
useful for two-dimensional systems. 


Analysis of a Simple Discrete-Time System 


To illustrate how the analysis can be done, we discuss a simple example. 
Consider a discrete-time adaptive controller that is based on estimation of 
the parameter @ in the model 


y(t + 1) = Oy(t) + u(t) (6.4) 
Let the controller be ` 
u(t) = —0(t)y(t) + yo (6.5) 
where 6 is an estimate of @ and yo the setpoint. If the process is indeed 
described by Eq. (6.4) and if the estimate @ is correct, the controller gives a 
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deadbeat response. The parameter is estimated by using a normalized gradient 
algorithm 
y(t) (yl + 1) — 6) 7H) ~ uO) 
a+ y(t) 

where y and œ are parameters. This is equal to Kaczmarz’s projection algo- 
rithm if y = 1 and & = 0. 

To analyze the closed-loop system, we must also have a description of the 
actual process. We assume that this is given by 


6(t +1) = 6(t) +7 (6.6) 


y(t + 1) = 8oy(t) +a + u(Z) (6.7) 


Notice that, because of the presence of the parameter a on the right-hand 
side, this model is different from the model (6.4) used to design the adaptive 
controller. Equations (6.4), (6.5), (6.6), and (6.7) thus describe a very simple 
case of adaptive control of a process with a constant unmodeled disturbance. 
Using Eq. (6.5) to eliminate u in Eqs. (6.6) and (6.7), we find that the closed- 
loop system can be described by the equations 


y(t + 1) = (8o ~ (6) x(t) +a + yo 
IE) ((8o — (2) A +a) (6.8) 
a+ +7(t) 


This is a second-order nonlinear system. To explore the behavior of this system, 
we follow the procedure of nonlinear analysis. 


A(e+1) =) +Y 


Equilibrium Analysis Equations (6.8) have the equilibrium solution 


y= Yo 
6 = Go + 2 
Yo 
Notice that the equilibrium value of the output is always equal to the setpoint 
in spite of the disturbance. This 1s a phenomenon that we have observed before 
in adaptive systems. (Compare with Example 3.5 and Example 5.2.) Unmodeled 
dynamics, however, give a parameter error. 
Linearizing Eqs. (6.8) around the equilibrium equations (6.9), we find that 
the system matrix is 


(6.9) 


— —Yo 
A= yž (6.10} 
— a 1 — y0 
a a 
This matrix has the characteristic polynomial 


Z +az+a 
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Ha+ ye) 
ye 


where 


It follows from the stability criterion for discrete-time systems (Schur-Cohn) 
that the characteristic polynomial has all its roots inside the unit disc if 


ag<1 
s =a +1 >00 
adge+a,+1>0 
Inserting the expressions for a, and ag into these conditions gives 


a 


(iii) y >0 


The equilibrium is stable if parameters a and y are inside the triangular region 
shown in Fig. 6.1. To have a stable equilibrium, it must thus be required 
that the magnitude of the disturbance a is less than the magnitude of the 
command signal yọ. In addition the adaptation gain y should not be too large. 
It is interesting to see the consequences of unmodeled dynamics. If there are 
no unmodeled dynamics (a = 0), then the condition for local stability of the 
equilibrium becomes 
aty 
O<y<2 x 


Stability is thus guaranteed simply by choosing a reasonable value of y. 
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Global Properties We now investigate the global properties when the param- 
eters are chosen in such a way that the equilibrium is stable. To get some 
guidelines for the analysis, we first simulate the system. In Fig. 6.2 we show 
a phase portrait for the case in which @ = 0.1, y = 0.1, @ = 1.5, yo = 1, and 
a = 0.9. It follows from Eqs. (6.9} that the equations have an equilibrium for 
y= 1.0 and 6 = 2.4 and from condition (ii) in Eqs. (6.11) that the equilibrium 
is stable provided that 0 < y < 0.22. The equilibrium is thus stable for the cho- 
sen value of the adaptation gain. Remember that the system is a discrete-time 
system. The discrete solution points are connected with straight lines to give a 
continuous graph. All trajectories shown in the simulation are approaching the 
equilibrium. Solutions with initial values 6(0) = 0 appear to have large excur- 
sions, and the trajectory with (0) = 2.5 seems to be oscillatory. To understand 
the behavior intuitively, we consider the equations for y and @ separately. It 
follows from Eqs. (6.8) that if Ê is constant, then the motion of y is governed 
by 


y(t + 1} = (8o — A) y(t) + a + yo 
This is a first-order difference equation with the equilibrium solution 


a+ yo 


= é) = — 
y= f(@) 146o, 


(6.12) 


-10 0 10 20 


Figure 6.2 Phase portrait for the system in the stable case. Parameter 
values are œ = 0.1, y = 0.1, 6) = 1.5, % = 1, and a = 0.9. The dashed lines 
indicate the interval 6) — 1 < @ < ĝo +1. The dot is the equilibrium point. 
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If parameter Ê is constant, the solution is stable if 
-i< 6 <O+1 


and unstable otherwise. These bounds are shown as dashed lines in Fig. 6.2. If 
the parameter 6 is kept constant, y diverges monotonically at the lower bound 
and diverges in an oscillatory manner with period 2 at the upper bound. In 
reality, parameter 6 will of course change. The smaller the adaptation gain 
is, the smaller rate of change. With the numbers used in the simulation the 
bounds are 0.5 and 2.5. The behavior shown in Fig. 6.2 can thus be explained 
qualitatively. The solution approaches the curve (6.12) and then moves along 
this curve. The variable y appears to grow exponentially for 6 < 0.5; it decays 
exponentially for 0.5 < @ < 1.5 and decays in an oscillatory manner for 
1.5 < Ê < 2.5. The variable grows in an oscillatory manner for 6 > 2.5. 

We now turn our attention to the equation for the parameter estimate. 
Introducing 


A 


6= 6-4 
we find 
6(é+1)= (1 — r) 6(t) + rts (6.13) 


This equation implies that the signals y and cannot be unbounded because 
Eq. (6.13) is always stable when y is sufficiently small. For large values of y(t) 
the added term is small, and the solution will decay. It thus appears as though 
the equilibrium solution that is locally stable may also be globally stable in 
this case. A more precise discussion of this is given in Section 6.5. 


Unstable Local Equilibria We now investigate what happens when parameters 
are such that the local equilibrium is unstable. We first observe that the 
instabilities may occur by violating any of the conditions given in Eqs. (6.11). 
Analyzing how the eigenvalues change with the parameters shows that the 
eigenvalue passes the unit circle with complex values if condition (i) is violated, 
through z = —1 if condition (ii) is violated and through z = 1 if condition (ii) 
is violated. We consider the situation in which the value of the adaptation gain 
is too large. Increasing the gain means that the solution will become unstable 
with period 2. Consider the case in which 65 = 1, œ = 0.1, yp = 1, and a = 0.9. 
The equilibrium is y = 1 and 6 = 1.9. It follows from the stability criterion 
that the equilibrium is stable if y < 0.22, With y = 0.5 the linearized closed- 
loop system is unstable. Figure 6.3 shows a simulation of the system. The 
behavior of the system is typical for the case with unmodeled dynamics. The 
output y and the parameter estimate Ê appear to approach their equilibrium 
values. The equilibrium is unstable and a diverging oscillation with period 2 
appears when y and Ê come sufficiently close to their equilibrium. The variables 
then oscillate with large excursions. When this happens, the modeling error 
becomes less significant, and the process output y and the parameter estimate 
6 approach their equilibrium values. The process then repeats all over again. 
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Figure 6.8 Simulation of a simple adaptive controller with unmodeled dy- 
namics. The equilibrium values of y and @ are indicated by solid straight 
lines. The true parameter value ĝo is indicated by a dashed straight line. 


The phenomenon that has been observed in many adaptive systems is called 
bursting. 

The simulation shown in Fig. 6.3 represents a very complex behavior. 
Although essentially the same phenomenon repeats itself, the solution is not 
periodic. This is seen more clearly if the system is simulated for a longer time. 
Figure 6.4 shows a phase plane when the simulation time is extended to 10,000 
time units. The solution is very irregular. There is, however, some pattern in 
the motion, as is indicated in the figure. For example, the state moves close to 
the curve given by Eq. (6.12) for part of the motion. The behavior shown is in 
fact an example of chaotic behavior. The pattern shown in Fig. 6.4 is called a 
strange attractor. 


Structural Stability 


Structural stability is an important concept in nonlinear dynamics. Intuitively, 
a system is structurally stable if small changes in the equations will not lead 
to drastic changes in the behavior of the system. A necessary condition for 
structural stability in the continuous-time case is that all equilibria are such 
that the linearized equations do not have eigenvalues whose real parts are 
zero. The equilibria are then said to be Ayperbolic. Stability and structural 
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Figure 6.4 Phase plane plot corresponding to the case in Fig. 6.3 when over 
10,000 time units are simulated. 


stability in adaptive systems are closely related to persistency of excitation. 
We illustrate this by two examples. 


EXAMPLE 6.1 Lack of excitation leads to instability 


Consider the model-reference adaptive system shown in Fig. 5.14(b). Assume 
that the input signal is e(t) = e*. The system can then be described by the 


equations 


ze = —e + ku, 
dé 

ad = —yY ele 
du, 

de 


where 6 = 6 — 8o. The equilibrium is e = 6 = u, = 0. Linearization around 
this point gives a linear system with the system matrix. 
-1 0 0 
A=]0 0 0 
0 0 -1 
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This matrix has the eigenvalues —1, 0, and —1, and the system is clearly not 
stable. o 


i EXAMPLE 6.2 Persistency of excitation gives structural stability 


Consider the same system as in Example 6.1, but assume now that the com- 
mand signal is a step, that is, u,(t) = 1. The system is then described by the 
equations 


de = 
g et RO 
dé > ye 
de Y 


The equilibrium is e = 6 = 0. Linearization around this fixed point gives a 
linear system with the system matrix. 


a= (7 8) 


This matrix has the characteristic polynomial 
s4+st+yk 
and the equilibrium is thus stable if yk is positive. o 


Figure 2.10 in Chapter 2, which illustrates a case of identification under 
closed-loop conditions, is a typical example of structural instability. Additional 
examples are given in Section 6.9. 


6.3 ADAPTATION OF A FEEDFORWARD GAIN 


The special case of adaptation of a feedforward gain has been discussed many 
times because of its simplicity. Let us therefore consider the structure of the 
equations in this case too. For the system in Fig. 5.14 we get 


dé 
e = CE ~ Ym (6.14) 


_ fym MIT rule 
= —u, Lyapunov rule 


where A, B, and C are matrices that give a realization of the transfer function 
&G(s). Notice that in this case the matrices A, B, and C, the regression 
vector Ø, and the error e do not depend on the controller parameters explicitly. 
Furthermore, the parameter is updated as 

dé 


a = 1 Pe(S) (6.15) 
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for a gradient scheme. Jf u, is a function of time, then Ym is also a function 
of time, and Eqs. (6.14) and (6.15) are simply time-varying linear differential 
equations. Such equations can have a complex behavior. We illustrate this by 
an example before proceeding. 


EXAMPLE 6.3 Adaptation of a feedforward gain 


In Example 5.1 we derived an adaptation law for adjusting the feedforward gain 
by using the MIT rule. The behavior of the system was illustrated in Fig. 5.3. 
The system is described by 

dy a 

= RO(t)ue(t) — y(t) 
and the parameter adjustment rule is 


+ 


T = —Y Ym (t)e(t) = ~Y Ym (E) (At) — Ym (4) 


Since the signal ym can be computed from the command signal ue, both u, and 
Ym can thus be regarded as known time-varying signals. The adaptive system 
is described by the equation 


di ($) = lo Eo ($) i ied ee) 


The system can thus be described by a time-varying linear differential equation 
of second order. In Fig. 6.5 we show three simulations for the case in which 
G(s) = 1/(s + 1), k = ko = 1, and y = 11. The reference signal is sinusoidal in 
all cases. The frequency is œ = 1 in the first case, œ = 2 in the second, and 
w = 3 in the third. The controller parameter converges to the correct value for 
w= 1 and ø = 3, but it diverges for w = 2. We thus have a situation in which 
the system is stable for one input but unstable for another. The system is stable 
for low frequencies of the input signal. As the frequency increases, it becomes 
unstable. It becomes stable again as the frequency is increased further. This 
pattern repeats itself as the frequency is increased further. o 


Example 6.3 shows that the system has quite a complex behavior that 
cannot be explained by the intuitive argument of the previous section. To 
understand what is happening, we analyze the equations describing the system. 
Equation (6.16) can be written as 


& = A(t)x + B(t) (6.17) 
This is a linear system with time-varying parameters. In the particular case 
in which the input te is periodic and we connect the adaptation when model 
output Ym has also become periodic, the system is also periodic. For such 
systems there is a well-developed theory that can be used to understand the 
behavior of the system. 
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Figure 6.5 Behavior of the controller gain for an MRAS using the MIT rule. 
The input signal is a unit amplitude sinusoidal with frequency (a) 1; (b) 2; 
and (c) 3 rad/s. The system has the transfer function G(s} = 1/(s + 1), the 
parameters are k = ky = 1, and the adaptation gain is y = 11. The dashed 
lines indicate the correct values of the gain. 


Floquet Theory 


To investigate the stability properties of (6.17), we consider the homogeneous 
part, when A(¢) is periodic with period t and continuous for all t. The solution 
is given by 
x(t) = D(t, to)x(to) 

where P(t,fo) satisfies the linear matrix differential equation 

d® 

— =Å . 

Te (4)m (6.18) 
Since A(¢) is periodic with period f, it follows that A(é+t) = A(t). This implies 
that if (t) is a solution, then (+r) is also a solution. Since the two solutions 
to Eq. (6.18) differ only in their initial conditions it follows that 


P(t+7) = OW (6.19) 


where W is a nonsingular constant matrix. Since the matrix ®(¢) is nonsin- 
gular for all £, it follows that W is also nonsingular. By repeated use of this 
equation we find that 

ât + nt) = p(t) W” 
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where t < Tt. We thus obtain the following result. 


THEOREM 6.1 Stability of linear periodic system 

The periodic differential equation (6.17) is stable if and only if all eigenvalues 

of the matrix W have magnitudes less than 1. Oo 
This result is actually all we need for stability analysis. We can, however, 

also obtain a slightly more general result. Notice that we can compute W 

simply by integrating the differential equation over one period with the initial 

condition equal to the identity matrix. 


THEOREM 6.2 Solution of periodic systems 
The solution to the matrix differential equation (6.18) has the form 


D(t) = D (i)e 


where C is a constant matrix and D is periodic with period T. 


Proof: Since the matrix W in Eq. (6.19) is nonsingular, there exists a matrix 
C such that 
W=e* (6.20) 


Introduce the matrix function D(t) defined by 
D(t)= pHi 
Then 
D(t+7) = (t+ CH = Bt) We eA = Dit) 
and the theorem is proven. oO 
Remark. From Eq. (6.20) we see that the differential equation (6.18) is stable 
if the matrix C has all its eigenvalues in the left half-plane, which means that 


the matrix W should have all its eigenvalues inside the unit disc. Stability can 
thus be determined by numerical integration over one period. oO 


We now show how the results can be used to investigate the stability of the 
system in Example 6.3. 


EXAMPLE 6.4 Parametric excitation 


Consider the system in Example 6.3. Let the command signal be u,(t) = 
sin wt. After a transient the model output becomes 


1 
m(£) = ———— sin (wt — arctan (w 
To determine the stability of Eq. (6.16), we compute W by integrating Eq. (6.18) 
over one period, that is, T = 27/w, with the initial condition ®(0) = F. Then 
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from Eq. (6.19) we get W = (7). Choosing @ = 2 and integrating to T = 7 
give l 
0.4873 0.7283 


© = 
(2) 0.2389 0.4967 
with eigenvalues 0.049 and 0.885 for y = 10 and 


p(z) = (05609 0.9960 
a= (o 2642 0.5463 


with eigenvalues 0.041 and 1.067 for y = 11. It can thus be concluded that the 
adaptive system will be stable for y = 10 but unstable for y = 11. 

This calculation can be repeated for many frequencies and many values of 
the adaptation gain to determine the values of @ and y for which the system is 
stable. The result of such a calculation is shown in Fig. 6.6. Notice in particular 
that Fig. 6.6 explains the behavior observed in the numerical experiment in 
Example 6.3, in which the system goes through a region of instability as the 
frequency of the input signal increases. Notice, however, that the system is 
stable for low adaptation gains, o 


Example 6.3 indicates that even very simple adaptive systems can exhibit 
complex behavior. The mechanism of periodic excitation can also give rise to 
instabilities in more complex adaptive systems. The analysis can be made in the 
same way as for the simple example, but the details are much more complicated. 
The behavior is typically associated with periodic excitation and comparatively 
high adaptation gains. The phenomenon illustrated in Fig. 6.5 is an example 


(A of oe a 
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Figure 6.6 Stability region for adjustment of a feedforward gain with the 
MIT rule. 
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of parametric excitation, that is, a system can be made unstable by changing 
its parameters periodically. A classical example is the Mathieu equation: 


Py dy 
IE +a + (8 +y cosot)y=0 


For @ = 0 this equation describes a pendulum whose pivot point is oscillating 
vertically. It is well known that the normal equilibrium, with the pendulum 
hanging down, can be made unstable by a proper choice of the parameters. 


EXAMPLE6.5 Lyapunov redesign 


In Example 6.3 we found that the MIT rule could give instabilities for large 
adaptation gains. Under the strong assumption that the transfer function of the 
process is strictly positive real, however, the control law derived from stability 
theory is stable for all values of the adaptation gain. We illustrate this in the 
simulation in Fig. 6.7, in which the Lyapunov rule is applied to the system in 
Example 6.3. Compare with Fig. 6.5. o 
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Figure 6.7 Behavior of the controller gain for an adaptive system based on 
Lyapunov stability theory when the input signal is a unit amplitude sinusoidal 
with frequency (a) 1; (b) 2; and {c} 3 rad/s. The system has the transfer 
function G(s) = 1/(s+1), the parameters are k = ky = 1, and the adaptation 
gain is y = 11. The dashed lines indicate the correct values of the gain. 
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Summary 


A discrete-time system and the feedforward gain example have been discussed 
in this section. The examples show that adaptive controllers can have rather 
strange properties. The phenomena could be explained by using simple math- 
ematics, but there will be difficulties in the general cases. It is therefore ap- 
propriate to consider some simplified situations in the coming sections. First, 
indirect and direct self-tuning regulators are discussed under idealized as- 
sumptions. Second, the adaptive control problem is divided into two parts with 
different time scales, and averaging techniques are used to analyze properties 
of the closed-loop systems. 


6.4 ANALYSIS OF INDIRECT DISCRETE-TIME SELF-TUNERS 


In this section we analyze the properties of indirect discrete-time selftuners 
of the type illustrated by the block diagram in Fig. 1.19. Since such controllers 
contain a recursive parameter estimator and a control design calculation, it 
is natural to investigate these subsystems separately. Since identification is 
performed in closed loop, there may also be undesirable effects due to interac- 
tion of control and identification. We start by investigating the properties of the 
recursive parameter estimator. Second, the design calculations must be consid- 
ered. It is particularly important to understand when the design calculations 
are poorly conditioned so that small changes in process parameter estimates 
may cause large changes in the controller parameters. 

It would be highly desirable to determine whether the adaptive system can 
track parameters of a time-varying system. This is a very difficult problem, 
and we therefore limit the analysis to the case in which the real system has 
constant parameters. This can be considered as a first test of an adaptive 
algorithm. To carry out the analysis, we also assume that the real system is 
described by models that are compatible with the models used for parameter 
estimation. In this case it makes sense to talk about the “true parameters.” In 
reality, however, we also have to deal with the fact that the models that we use 
are approximations. This is called the nonideal case. This problem is discussed 
later in Section 6.9. 


Properties of Recursive Estimators 


To investigate recursive estimators, it is necessary to make some assump- 
tions on how the data was generated. In this section we make the assump- 
tion that the data is generated by a model having the same structure as the 
model used in the estimation. It is also necessary to specify the nature of 
the disturbances—for example, whether they are deterministic or stochastic. 
We also find that it is important for there to be sufficient excitation and that 
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identification under closed-loop conditions may cause difficulties. 

The deterministic case, in which data is generated from a system that is 
compatible with the model used in the estimator, is particularly simple. In this 
case it is possible to derive general properties of the estimators. 


Projection or Gradient Algorithms 
The properties of the projection or gradient algorithm are now investigated in 
the ideal case in which data is generated by the model 

y(t) = p" (t)6° (6.21) 
We have the following result. 


THEOREM 6.3 Projection algorithm properties 
Let the estimator 


6(t) = Ê(t — 1) + O 
e(t) = y(t) — p7 ()6(t — 1) = g7 (t) (0° — 6(¢ — 1)) 


with a > 0 and 0 < y < 2, be applied to data generated by Eq. (6.21). It then 
follows that 


(6.22) 


(i) |6(¢) — 0°|| < ÊE- 1) ~ °l < êo) -e >i 
.. . e(t) _ 

” mo Jato 

(iii) lim ||6(2) —6(t—k)}=0 for any finite k 


Proof: Introduce 6(£) = Ô(t) — 6° and 
V(t) = F (6) = EH 
It follows that 
e(t) = 97 (1)6° — g7 (f)6(t — 1) = —p" (Ht — 1) 


Subtracting 8° from both sides of the parameter equation in Eqs. (6.22) and 
taking the norm, we get 


yuy — oY DƏ- Delt) p7 Eoee 
Vi —VE~ a =2 atp oo ~ (a + gT Op 
ye (t) 


= OTT oT DOO 
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e Ogle) 
ye Hele l 
< g <0 
a + pT (t)p(t) 
and the inequality follows from a > 0 and 0 < y < 2. Property (i) has thus 
been established. It follows from the preceding equation that 


ye*(k) 
a +97 (k)o(k) 


x(t) = —2 + 


V(t) = V0) +5 x(k) 
k=1 


Hence 


; e*(k 1 
Daris <5 VO-VO) 


Since 0 < V(t) < V (0), it follows that the normalized error 
e(t) 

vat oel) 
is in k, that is, squared summable, and thus property (ii) follows. From 
Eqs. (6.22), 

2 nT 
16) - ôt- yp = 2 Ove) 
(a + 97 (p(t) 


__ Pe) ( ___ a 
a + oF (t)o(t) a + 97 (é)e(t) 
It follows from property (ii) that the right-hand side of the preceding equation 
goes to zero as t > oo if a > 0. Hence 
k 2 
X Âôt—i+1)- ôt- i) 


Ha 
< 2 |6(¢ — i + 1) — 6 — i)? 


where the right-hand side goes to zero as ¢ => co for finite k. O 


lÊ) — ôE — k)? = 


Remark 1. For y = 1 and a = 0 the algorithm reduces to Kaczmarz’s projec- 
tion algorithm. 

Remark 2. Notice that the result does not imply that the estimates 6(t) con- 
verge. 

Remark 3. The function V(t) can be interpreted as a discrete-time Lyapunov 
function. Oo 


Theorem 6.8 is useful because it gives some properties of the estimator 
that are valid no matter how the regressors g(t) are generated. Additional 
conditions are required to guarantee that the estimates converge. The theorem 
will also be useful to prove convergence of the indirect adaptive schemes. 
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Parameter Convergence of Gradient Algorithms 


We now give conditions for the estimates to converge to the true parameter 
values. Notice that to pose such a problem, it is necessary to assume that data 
is generated by a model that is compatible with the model used to formulate 
the estimate. Parameter convergence is closely related to system identification. 
The properties of identifiability and persistency of excitation play an essential 
role. The convergence rate depends on the algorithm used and the amount of 
excitation. We first consider the gradient algorithm, which is simpler than the 
least-squares algorithm, although it converges at a considerably slower rate. A 
typical projection or gradient algorithm is given by Eqs. (6.22), where a > 0 
and 0 < y < 2. The estimation error is given by 


(d = 6() — & = A(t — 1)6(t -—1) (6.23) 
where (OPTA) 
YPP 
A-9) =1- Ty eT OO 


The problem of analyzing convergence rates is thus equivalent to analyzing the 
stability of Eg. (6.23). Notice that 


7 rolto? (£) _ v9" (ipli) 
Ale— Ie) = (7 ater Det) P) = 9) (: “Gag? De) 


The second factor on the right-hand side is a scalar. This implies that the 
vector p(t) is an eigenvector to A(t— 1) with an eigenvalue that is less than 1. 
The eigenvalue is zero for y = 1 and a = 0. The following lemma is useful to 
analyze Eq. (6.23). 


LEMMA 6.1 Stability of a time-varying system 
Consider the time-varying system 


x(t + 1) = A(t)x(t) 

y(t) = CH)x) 
Assume that there exists a symmetric matrix P(t) > 0 such that 
AT (t)P(t + DACH — P(t) = —C7 (9C (t) (6.25) 


Then Eqs. (6.24) are stable. Moreover, if the system is uniformly completely 
observable, that is, if there exist 8) > 0, Bo > 0, and N > 1 such that 


(6.24) 


t+N-1 
0<fiT< YO D(C CD(k) < Bal < o 
k=t 
for all £ and where ®(k, ¢) is the fundamental matrix, then Eqs. (6.24) are also 
exponentially stable. 
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Proof: Introduce the function 
V(t) = x7 (P()x(8) 
Hence 
V(t +1) -— V(t) =x" (HA? HPE + DAHA — x" (HP (Hx) 
= —x" (£)C7 (HC (Axt) < 0 
The function V can be considered a Lyapunov function for a discrete-time 


system. To prove stability for a discrete-time system using Lyapunov theory, 
we have to show that the difference 


AV(t) = V(QE+1)-Vi(t) <0 
and that the matrix P(t) is positive definite. Iterating the system equations N 
steps gives 


t+N-1 


ViE+N)—V(QQ)=— S x" (RCT (R)C(R) x(k) 


kext 


HNI 
= —x7 (t) ( SO DT(k,t)CT(k)C(k)®(h, a) x(t) 


=t 


T Bi 
< —Bix’ ()x(é) < -Tan PE) V (£) 


where Amax(Pt) is the largest eigenvalue of P(t). Hence 


ve+N)< (1-72) VO = Avo 
From Eq. (6.25) it follows that 
P(t) = CT (C(t) + A7 (P(E + DAG) 
= C7 (HCH 
+ AT (t) (C7 (t+ ICE + 1) + A’ (E+ DP(t + 2A + 1)) A(t) 


oo 
=X 7 (k,t)C7 (k)C(k)O(k, t) 
k=t 
t+N—1 
> $O D7 (k,t)C? (k)C(R) G(R, t) > Bil 
k=t 
Thia shows that Amaz{P{t)) > B1 and fs < 1, which implies that V(t) goes to 
zero exponentially. Furthermore, 
t+N-1 


P(t +N) = P(t)— $ 7 (k,t)C7(R)C(R)P(R, t) < Bs P(2) 
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or 
HN-1 


T T p2 
A 2 BPR, CT (KCRC, t) < TT 


P(t) < 


The matrix P(é) is thus bounded from above and below. Since V (£) goes to 
zero exponentially and P(t) is bounded, it follows that the system (6.24) is 
exponentially stable. g 


Applying this lemma to Eq. (6.23), we get the following theorem. 


THEOREM 64 Exponential stability 
The difference equation (Eq. 6.23) is globally exponentially stable if there exist 
positive constants Br, Be, and N such that for all t, 


#+N—1 


0<Ails D> 9(h)97 (k) < pal < oo (6.26) 


k= 


Proof: Choose P = I and 


CW) = Vra + 2 Pe g7 


where the argument £ of 9 is suppressed. A straightforward calculation shows 
that Eq. (6.25) is satisfied, so the system is stable. To prove exponential 
stability, first observe that uniform observability of (A(z), C(k)) is equivalent 
to uniform observability of ((A(k) — B (k)C(k)), C(k)). Choosing 


Y 
BE = CKcee.c-—c--——— 
* Joea t-o 


we find that A(k) — B(k)C(k) = I, and uniform asymptotic stability then 
corresponds to Eq. (6.26). O 


Notice that Eq. (6.26) is closely related to persistent excitation. (Compare 
with Definition 2.1.) It is thus found that exponential convergence of the gra- 
dient algorithm is closely connected to whether the input signal to the system 
is persistently exciting of sufficiently high order. 

It should be pointed out that condition (6.26) is a persistent excitation con- 
dition for the regressors, not the external reference signal for the system. The 
excitation can be provided by the command signals and by the disturbances 
acting on the process. Notice, however, that excitation may be lost by feed- 
back, which can introduce relations between the variables appearing in the 
regression vector. We discuss this later in this section. 
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Recursive Least Squares 


Parameter convergence for recursive least squares is first discussed for the 
simple model (6.21), which is linear in the parameters and for which are no 
disturbances. Let the parameter vector have n elements. The parameters can be 
calculated exactly from n data points, provided that the vectors ø(1),... p(n) 
are linearly independent. The least-squares estimate is given by 


A(n) = (È TO > o(k)y(k) 
k=1 k=1 


n -1 n 
= (È paro) S ee? (t) = 6° (6.27) 
ksl k=i1 


The correct state is obtained in n steps. If the estimate is instead calculated 
by recursive least squares, the following estimate is obtained: 


t A 
A(z) = G t0) + pD TO (>: p(k)y(k) + F~ 60) (6.28) 
k=l k=l 
where §(0) is the initial estimate and P(0) is the initial covariance of the 
estimator. By making P(0) positive definite but arbitrarily large, the result 
from the recursive estimation can be made arbitrarily close to the true value. 
From this analysis we obtain the following result. 


THEOREM 6.5 Property of RLS 


Let the recursive least squares be applied to data generated by Eq. (6.21). Let 
P(0) be positive definite and let 6(0) be bounded. Assume that 


PEI < X eo" (k) 
k=1 


where f(t) goes to infinity. Then the estimate converges to 6°. D 


This discussion shows that in the deterministic case it is possible to ob- 
tain parameter estimators that converge in a finite number of steps. The key 
assumption is that the regressors are linearly independent, so } p(k)g" (k) 
is of full rank. When the parameters are changing, a least-squares estimator, 
in which the covariance matrix P is regularly reset to aJ, is a good imple- 
mentation. This procedure is called covariance resetting. To obtain an estimate 
that reacts rapidly to parameter changes, it is also possible to have several 
estimators in parallel, which are reset sequentially. 

Results similar to Theorem 6.3 can also be established for the least-squares 
algorithm and several of its variants. The key is to replace function V(t) in 
Theorem 6,3 by 

Vit) = F (HPE) 
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and add assumptions that guarantee that the eigenvalues of F stay bounded. 
One way to do this is to use the constant trace algorithm (see Section 11.5). 

So far, only the general model (6.21) has been discussed. The properties 
of estimates of parameters of discrete-time transfer functions will now be 
considered. The uniqueness of the estimates is first explored. For this purpose 
we assume that the data is actually generated by 


A*(q)y(é) = BY(g)u(t) + eft +n) (6.29) 
where A° and B°’ are relatively prime. If e = 0, deg A > deg A?, and deg B > 
deg B°, it follows from Theorem 2.1 that the estimate is not unique because the 


columns of the matrix ® are linearly dependent. Theorem 2.10 gives conditions 
for uniqueness of the least-squares estimate. 


The Stochastic Case 


Consider the model 
y(t) = 9" (O° + eft) 
where {e(t)} is a sequence of independent Gaussian (0,o) random variables. 


The least-squares estimator is given by Eq. (6.28). The covariance of the esti- 
mate for large t is (see Theorem 2.2) 


t =} 

P(t) = 0? (>: TO) 
k=1 

By taking the covariance of the estimate as a measure of the rate of con- 

vergence, it is found that under uniform persistent excitation the matrix P 

converges at the rate 1/ż. This implies that the estimates converge at the rate 


1/vt. 


THEOREM 6.6 Convergence of RLS 


Let the least-squares method for estimating parameters of a transfer func- 
tion be applied to data generated by the model of Eq. (6.29) where {e(t)} is a 
sequence of uncorrelated random variables with zero mean and variance g^. 
Assume that the estimated model has the same structure as the process gen- 
erating the data, that is, the ideal case, Further assume that the input signal 
is persistently exciting of order deg A + deg B + 1. Then 


(i) 6(t) + 6° in the mean square as £— 00 
. Â 0 g? _ 1 T 3 
(ii) var (8—6) ~ — lim > ¢ Gi 


(m 


Remark 1. The estimates do not converge to the true parameters when e(t) 
is correlated with e(s) for t # s. 
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Remark 2. Theorem 6.6 gives the convergence rate for the parameter error 
in the ideal case. More complex behavior can be obtained when the different 
components of the regression vector have different convergence rates (see Ex- 
ample 2.11). oO 


Unmodeled Dynamics 


So far, it has been assumed that the true process is compatible with the model 
used in parameter estimation, It frequently happens that the true process is 
more complex than the estimated model. This is often referred to as unmodeled 
dynamics. The problem is complex, and a careful analysis is lengthy; roughly 
speaking, the parameters will converge to a value that minimizes the least- 
squares criterion: 


T 
1 
V@=F 2 (Ayre) — Buç(t)) (6.30) 
where yf and up are the filtered process input and output, that is, 


yp = Hey 
up = Hype 


and the parameter @ represents the coefficients of the polynomials A and B. 
The minimum exists under certain regularity conditions, and the minimizing 
8 is unique under the condition of persistency of excitation. The minimizing 
value will depend on the data filter Hy and the spectrum of the reference 
signal and the disturbances. 


Identification in Closed Loop 


When discussing parameter estimation in Chapter 2, we observed that identi- 
fiability could be Jost if the input was generated by feedback from the output. 
The reason is that the feedback introduces dependencies in the regression vec- 
tor. (Compare with Example 2.10.) Since this is very important for the behavior 
of direct adaptive controllers, we will investigate the problem in a little more 
detail. In this analysis we will consider what happens when we perform system 
identification to data generated by feedback. Consider a process described by 


Ay(t) = Bu(t) + v(t) (6.31) 


with the controller 
Ru = Tu, —Sy 
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where polynomials R, S, and T have constant parameters. The closed-loop 
system is given by 


BT R 

Y= AR} BS“ AR} BS” 

aa AT oO S, 
AR+BS ©" AR+BS 


With a system identification experiment it is possible to determine the transfer 
functions 


BT AT 
Gı=3R+85 Č" AR1BS 

R S 
Gs=3R+B5 “ARV BS 


that appear in these equations. There are no problems with identifiability if 
the input signal u. is persistently exciting of sufficiently high order because the 
polynomials A and B are then readily determined from G, and Gz. However, 
if the command signal is zero and all excitation comes from the disturbance, 
we can determine only the polynomial 


A, =AR+BS (6.32) 


To achieve identifiability, it must also be required that the signal v be per- 
sistently exciting of sufficiently high order. The question of identifiability of 
polynomials A and B then becomes a problem of uniquely determining A and 
B from Eq. (6.382) when polynomials R and S are known. If Ap and Bo are 
solutions, the genera! solution is 


where Q is an arbitrary polynomial. When the model structure is specified, the 
highest degree of A is also given. The solution is thus unique only if polynomials 
R and S have sufficiently high degree. To achieve identifiability in closed loop, 
it is therefore important that the controller be of sufficiently high order. It is 
natural to assume that R and S have the same degree. Identifiability is then 
obtained if 


deg R = degS > deg A (6.33) 


In Example 3.2, in which deg A = 2, deg B = 1, and deg Fh = deg S = 1, we 
do not have identifiability under closed loop with ue = 0. However, if it is 
required that the controller has integral action as in Example 3.10, we have 
deg R = deg S = 2, and the condition (6.33) holds. To achieve identifiability, it 
must, of course, be required that the disturbance be persistently exciting. 
Also observe that if a pole placement design is used, all models that are 
estimated will give the correct closed-loop characteristic polynomial. 
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Design Calculations 


The design calculations are an important part of indirect adaptive systems. 
Theoretically, the design procedure is represented by the function y, which 
maps process parameters @ to controller parameters &. The properties of x 
will, of course, depend on the parameterization of the model and the design 
procedure chosen. The function can often be quite complicated. It is impor- 
tant that the map gives unique controller parameters and that there are no 
Singularities in the map.We discuss the properties of the map in some simple 
cases. 


Consider the process model 
Ay = Bu (6.34) 


where it is assumed that A has degree n and B has degree n—1. The model thus 
has 2n parameters. If pole placement design is used, the controller parameters 
are given by 


AR + BS = Asm (6.35) 


where F and S have the same degree m as the observer polynomial A,. The 
minimum-degree solution corresponds to m = n—1, but an observer of higher 
order is often preferable to improve the robustness of the system. Without loss 
of generality, £ can be monic. The controller then has 2m + 1 parameters. The 
function y is thus a map from R?” to R?"+!, where m > n—1. Since Eq. (6.35) 
becomes singular when polynomials A and B have a common factor, it follows 
that the map 7 has singularities. The problem with design singularities is 
illustrated by an example. 


_ EXAMPLE 6.6 Singularities for pole placement design 
Consider the model of Eq. (6.34) with 


A(q) = Č + agta 
B(q) = boq + by 


In Example 3.2 a controller was designed for 


Am(q) = @ + mig + ame 
A,(qg)=qt+a, 


In this case the controller and process parameters are 


v= (rı So 81) 0 = (a; ado bo bı) 
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and the map 7: Rt — R? is given by 


p, — Comat + (Az — m2 ~ CoGmi)bobr + (Go + amı — 21) BF 
i b? — ibob + azb? 


2 
S by(QoOm1 — @2 — Oyj 8) + Ay + One — aap) 


S 
i b? — ibob + azb? 
bo(üm102 — @1A2 — Aolm2 + Apt2) 
eI 6.36 
b? — a1ibob1 + azb ( ) 
ges by (e102 — Am102 + Gomz — AoA) 


b? — arbobi + azb 


A bo(a2am2 ~ 0% — Qoam201 + Qo020m1) 
b = aibb + azb? 


The map y is singular when the denominator in Eqs. (6.36) vanishes, that is, 
when 


b? — arbob: + a2b2 = 0 g 


Singularities of the type in Example 6.6 will appear for practically all 
design methods. Since the singularities are algebraic surfaces, the parameter 
estimates must pass them if the algorithms are not initialized properly. There 
are several ways to avoid the difficulties. One possibility is to test for common 
factors and to cancel them if they appear, but such a procedure will require 
test quantities. It will also make y discontinuous, which creates difficulties in 
the analysis. Another and better solution is to find design techniques such that 
the mapping 7 is smooth. This is an open research problem, which so far has 
received little attention. 

The following example illustrates what happens if no precautions are taken 
with cancellations. 


EXAMPLE 6.7 Indirect adaptive system with design singularities 


Consider the system m Example 6.6, and let the controller be an indirect 
adaptive system that is based on estimation of the parameters of the model. 
The desired dynamics A,, are chosen to correspond to a second-order system 
with œ = 1.5 and ¢ = 0.707. The observer polynomial is chosen to be A, = z. 

Figure 6.8 shows the results obtained when the adaptive algorithm is 
applied to a first-order system 


1 
G(s) = ai 
Notice the strange behavior of the output. This would have been even worse if 
the control signal had not been kept bounded in the simulation, The parameter 
estimates converge very quickly to values such that A and B have a common 
factor. The Diophantine equation is then singular, as shown in Example 6.6, 
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Figure 6.8 Simulation of an indirect adaptive pole placement controller 
based on a second-order process model of a first-order process. (a) Output 
and reference value. (b) Control signal. (c) Estimated process parameters. 
(d) Calculated controller parameters. 


and the controller parameters become very large. The consequences of cancel- 
ing a possible common factor and making a design for a first-order system are 
illustrated in Fig. 6.9. In this particular case a factor is canceled if poles and 


zeros are so close that 
A =) b? — arbobi + azb? 
bo bg 


The performance is now very good. C 


< 0.01 (6.37) 


Summary 


Parameter convergence for indirect adaptive algorithms depends critically on 
the assumptions of identifiability and persistency of excitation. Analysis of 
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Figure 6.9 Simulation of an indirect adaptive pole placement controller 
based on a second-order process model. A possible common factor in the esti- 
mated process transfer function is canceled before the control law is calculated 
if the condition of Eq. (6.37) holds. (a) Output and reference value. (b) Control 
signal. (e) Calculated controller parameters. 


the convergence rate of estimators shows that the convergence rate depends 
drastically on the underlying process being deterministic or stochastic. It also 
depends on the algorithm. A least-squares algorithm in the deterministic case 
gives convergence in a finite number of steps, provided that the input is persis- 
tently exciting. The gradient algorithms give exponential but generally much 
slower convergence than the least-squares algorithm. The convergence rate is 
much slower in the stochastic case. Analysis of the convergence rate for estima- 
tors gives only partial insight into the convergence rate of adaptive algorithms. 
To obtain a detailed understanding, it is necessary to consider that the input 
to the system is generated by feedback. 


6.5 STABILITY OF DIRECT DISCRETE-TIME ALGORITHMS 


Stability was discussed in connection with model-reference adaptive system in 
Chapter 5, It was in fact the key design issue in the MRAS. The problem was 
easy to resolve in the cases in which all the state variables were measured and 
for output feedback of systems in which the dynamics were SPR or could easily 
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be made SPR. In these cases the MRAS has the property that arbitrarily large 
adaptation gains can be used. 

A stability proof for a direct discrete-time adaptive contro! law (MRAS or 
STR) for a general linear system will now be given. Some simplifications will 
be made in the algorithm to avoid too many technicalities. 


The Algorithm 


Direct algorithms for adaptive control were discussed in Section 3.5. We give 
the proof for a simple algorithm of this type. Consider a process described by 
the difference equation 


A` (qy) = B* (q u(t —d) (6.38) 


Let the desired response from command signal to process output be character- 
ized by 
An (QT )I(E) = toue (t — d) 


This specification implies that all process zeros are canceled. Furthermore, let 
the observer polynomial be A,. A direct algorithm can then be formulated as 
follows, Estimate parameters of the model 


ALA y(t + d) = Ruft) +8*y(t) = 97 (06 (6.39) 
where 
8 = (ro Fi +. FR Sq Sy... s )f j (6.40) 
p(t) = (u(t) ult—4) ... ulf—-k) y(t) yt-1) ... y—-1)) 


The parameters are estimated by using the following projection estimator: 
, 4 vo(t—d) 
OE) = O(t — 1) + ——— e(t 


e(t) = y(t) — g” (t - d)6(t — 1) 


with 0 < y < 2 and @ > 0. This estimator is the same as Eqs. (6.22) except 
that @ now has index £ —d instead of t. The properties given in Theorem 6.3 
are still valid. 

The control law is 


(6.41) 


R*u(t) + S*y(t) = toAtue(t) (6.42) 
or, equivalently, , 
67 (t) (A7 Amp (t)) = toAgue(t) (6.43) 


where ue(t) is the desired setpoint. Notice that it must be required that Ê; (t) = 
Folt) Æ 0; otherwise, the control law is not causal. 
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Preliminaries 

Since the proof consists of several steps, we outline the basic idea. The prop- 
erties of the estimator were given in Theorem 6.3, which proved that the esti- 
mates are bounded and that a normalized prediction error converges to zero. 
However, the theorem does not show that the estimates converge. By intro- 
ducing the control law and the properties of the system to be controlled, it 
can then be established that the signals are bounded and that the controlled 
output converges to the command signal. 

If the input and output signals of the system can be shown to be bounded, 
then ¢ given by Eqs. (6.40) is bounded. If ø(t—d) is bounded for all £, it follows 
from Property (ii) of Theorem 6.3 that the prediction error e(t} goes to zero. 
The following result is useful to establish the boundedness of ø. 


LEMMA 6.2 Key technical lemma 


Let {s:} be a sequence of real numbers and let {o;} be a sequence of vectors 
such that 


cll < 
ll ell Sex + cz max |s| 


Assume that 5 
St 
z = — 0 6.44 
i &1ı + a207 O ~ ( ) 
and that 
lim s(t) = 0 
i> 


where @; > 0 and a > 0. Then ||o;|| is bounded. 


Proof: The result is trivial if s; is bounded. Hence assume that s; is not 
bounded. Then there exists a subsequence {z,} such that |s; | + oo and |s;| < se, 
for t < tp. For this sequence it follows that 


St | Si 


i] > —___+____ > —. >0 
Oy + agola;,| ~ G1 + (ci + colsi}? T aac? 


where 0 < @3 < Q2. This contradicts Eq. (6.44) and proves the statements. O 


Main Result 


The main result can now be stated as the following theorem. 


THEOREM 6.7 Bowndedness and convergence 

Consider a system described by Eq. (6.38). Let the system be controlled with 
the adaptive control algorithm given by Eqs. (6.40), (6.41), and (6.42) where 
the command signal u, is bounded. Assume that 


Al: The time delay d is known. 
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A2: Upper bounds on the degrees of the polynomials A* and B* are known. 
A3: The polynomial B has all its zeros inside the unit disc. 
A4: The sign of bo = ro is known. 


Then 
(i) The sequences {u(t)} and {y(t)} are bounded. 
(ii) lim | A7, (g~")9(t) — touc (t — d)| = 0 


Proof: Introduce the control error 
E(t) = Aj (Amy (t) — totelt — d)) = P*y(t) — toAgue(t — d) 
= P*y(t) — 67 (t — d) (P*p(t — d)) 
= P*e(t) + P* (07 (t — 1) p(t — d)) — 67 (t — d) (P*p(t — d) 
deg P 
= Pre(t) + X pi (8(t— 1— i) — 0(¢—d))” p(t—d — i) (6.45) 
i=0 
where P = A,A,, has been introduced to simplify the writing. The first two 
equalities are trivial. The third is obtained from Eg. (6.39), the fourth from 
Eqs. (6.41), and the last by expanding the expression. 
It now follows from properties (ii) and (iii) of Theorem 6.3 that 
e(t) 


lim —— = = 0 
mo ya +p (t— d) p(t — d) 


It follows from the first equality in Eq. (6.45) that 


AZA} y(t) = €(#) + toAzue(t) | 
Since the polynomials A, and A,, are stable and since u, is bounded, it follows 
that 
< 
I(E) <2 + Br max le(2) 


Moreover, since the polynomial B is stable, it follows that 


|u(t —d)| < a2 + f2 max lyk) | 
Hence 
l(t — A)| < a + Bs max Je(h)| 


If we apply Lemma 6.2, it follows that p(t) is bounded and that e(t) + 0 as 
t — oo. Since the polynomial Až is stable, property (ii) also follows, m 


Remark 1. We used an algorithm for which the details of the proof are simple- 
With minor modification the results can be extended to cover many of the direct 
algorithms given in Section 3.5. 
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Remark 2. A minor modification of the algorithm is necessary to ensure that 
ĉo # 0. One way to do this is as follows: If fo(t) = 0, modify y to give f(t) Æ 0. 
Theorem 6.3 will still be valid with this modification of the algorithm. Since 
the estimator properties enter into the proof only via Theorem 6.3, the result 
stil] holds. 


Remark 3. Notice that it does not follow that the parameter estimates con- 
verge. The fact that the control error nonetheless goes to zero depends on an 
interplay between the estimation and the control algorithms. This property is 
a special feature of direct algorithms. 

Remark 4. The minimum-phase property is used to conclude that u is bounded 
when y is bounded. 


Remark 5. Notice the similarity between Eq. (6.45) and the augmented error 
introduced in Chapter 5. g 


Discussion 


It has been established that a direct adaptive controller gives a closed-loop 
system with bounded signals and desired asymptotic properties, provided that 
Assumptions Al—A4 are valid. Assumptions Al and A2 are necessary to write 
down the algorithm. Knowledge of the time delay (with a resolution correspond- 
ing to the sampling period) is essential. The signals will not be bounded if d is 
too small. Assumption A3 implies that the sampled system is minimum-phase; 
it is required because all process zeros are canceled in the design procedure. 
The error equation will not be linear in the parameters if this is not done. 
Assumption A4 is essential, since bp is absorbed in the adaptation gain y, to 
guarantee that f(t) Æ 0 for all times, Assumption A2 implies that the adaptive 
control law must have a sufficient number of parameters. This means that the 
model used to design the adaptive controller must be at least as complex as the 
process to be controlled. The consequences of violating the assumptions will be 
discussed later. 


Extensions 


The results can be extended in several different directions. Similar results can 
also be given in the continuous-time case, in which the underlying model can 


be written as 

A(p)y(t) = B(p)u(t) 
where A and B are polynomials in the differential operator p = d/dt. Assump- 
tions Al—A4 are then replaced by the following assumptions: 
Al’: The pole excess deg A — deg B is known. 
A2’: Upper bounds on the degrees of the polynomials A and B are known. 
A3’: The polynomial B has all its zeros in the left half-plane. 
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A4’: The sign of bg is known. 
The results can also be extended to systems with disturbances generated from 
known dynamics. 

The gradient estimation algorithm can be replaced by other, more efficient 
methods. Theorem 6.3 then needs to be generalized. Many types of least- 
squares-like algorithms can be covered by replacing the function V = 676 
in Theorem 6.3 by V = 67 P-'6 and adding assumptions that guarantee that 
the eigenvalues of P stay bounded. Other control laws can also be treated. One 
important situation that has not been treated is the case in which the control 
signal is kept bounded by saturation. Theorem 6.3 still holds, but Theorem 6.7 
does not, since Eq. (6.42) does not hold when the control signal saturates. 


Gronwall-Bellman Lemma 


The essential idea in the proof of Theorem 6.7 is the separation of the adaptive 
controller into two parts. First, some properties of the estimator are established 
that are independent of how the control signa! is generated. Second, properties 
of the controlled system are derived. Convergence and stability are derived on 
the basis of the key technical lemma (Lemma 6.2). This procedure can be used 
for many different adaptive schemes. 

The key technical lemma is a simplified version of the Gronwall-Bellman 
lemma, which is a standard tool for proving the existence of solutions to 
ordinary differential equations. There are both continuous-time and discrete- 
time versions of this lemma. 


LEMMA 63 Gronwall-Bellman lemma: Continuous time 
If u,v > 0, if cy is a positive constant, and if 


ult) <c f ' u({s)v(s) ds (6.46) 
then 
u(t) < cy exp (f u(s) ds) Oo 


LEMMA 64 Gronwall-Bellman lemma: Discrete time 
If u,v > 0, if cy is a positive constant, and if 


i—1 


u(t) <cy > ufk)v(k) (6.47) 
k=0 
then 


u{t) < cı exp (> n) o 


k=0 
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By using the Gronwall-Bellman Jemma, many direct adaptive algorithms 
can be analyzed in the following way: 


« Show that growth conditions such as Eq. (6.46) or Eq. (6.47) hold. 
a Show properties analogous to Eq. (6.44) for the signals u and v. 
e Use the Gronwall-Bellman lemma to get stability. 


These steps can be used as a template for proving convergence and stability 
for adaptive algorithms. 


6.6 AVERAGING 


The results in the previous sections do not permit a detailed investigation 
of adaptive control algorithms. For example, no information about transient 
behavior is available until much more detailed analysis is undertaken. The 
conventional methods for investigating nonlinear systems involve investigation 
of equilibria and analysis of the local behavior near the equilibria. Such an 
approach will give only local properties, although in some special cases it 
may be possible to proceed further and obtain global properties. The results 
of the analysis can then be augmented by simulations. For purposes of this 
discussion it is useful to write the equations of motion of the complete system 
in a comprehensive form such as Eqs. (6.1) and (6.2) or Eqs. (6.3). In an 
adaptive system it is natural to separate the states of the system and the 
process parameters. The process parameters are changing more slowly than 
the states. This separation of time scales is used in the averaging theory to 
gain more insight about the properties of the closed-loop system. The idea of 
averaging originated in the analysis of planetary motion. 


The Averaged Equations 


The analysis of the dynamics of adaptive systems is generally quite compli- 
cated because the complete system is often of high order. Analysis of a direct 
algorithm for a discrete-time second-order system with four unknown param- 
eters using a pradient method leads to a difference equation of order 8 (two 
states of the system, four parameters, and two difference equations to generate 
the regression variables). Ten more equations are obtained if a least-squares 
estimation algorithm is used. 

Because of the special properties of adaptive systems, however, there is 
an approximate method that will simplify the analysis considerably. The basic 
idea is that the parameters change much more slowly than the other variables 
of the system. This property is intrinsic to the adaptive algorithms. If this were 
not the case, we could hardly justify using the notion of parameters. 

To describe the averaging methods, consider the adaptive system described 
by Eqs. (6.1) and (6.2). The rate of change of the parameter Ê can be made 
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arbitrarily small by choosing the adaptation gain y sufficiently small. For 
simplicity we use the simple gradient algorithm 


S = 79(0,8)e(8,6) (6.48) 


The product ge on the right-hand side depends on #4 and č, where 3 = 8(6) 
varies slowly and ¢ varies fast. The key idea in the averaging method is to 
approximate the product pe by 


G(6) = avg {p (3(4), £(8(8), 2) e (0(6), € (8(8), £))} 


where avg{-} denotes the average and ¢ (9(8),£) is computed under the as- 
sumption that the parameters @ are constant. The average can be computed in 
many ways. Typical examples are 


ave {f (6,£(6,t),2)} =z f, f (6,£(6,2),t) dt 
T 
ave {f (6,5(6,t),1)} = lim | f (6,£(6,t),0) dt 


ave {f (6, €(6,t),t)} —Ef (6, €(6,t), t) 


The first alternative is applicable when f is periodic with period T, and the 
last equation applies when € is a stationary stochastic process. Notice that 
the averaged equations can be calculated only when the signals are bounded. 
This implies that the closed-loop system must be stable if the parameters 6 are 
fixed. The calculation of &(29(6), t} is a straightforward exercise in linear system 
analysis. However, the expressions may be complex for high-order systems. 
Symbolic calculation is a useful tool for carrying out the calculations. The 
use of averaging thus results in the following averaged nonlinear differential 
equation for the parameters: 


d — y avg {p (8(8), E(8(8), £)) e (0(8), E28), £))} = 0 (6.49) 


This equation can also be written as 


dé ~ - 
an? avg { (Gv ((8), p)v) (Ga(8(6), p)v) } = 0 (6.50) 
Notice that the transfer functions Gey and Gy, depend on the averaged param- 
eter 6. When the averaged equations are obtained, the behavior of the state 
variables € can be obtained by linear analysis. 

Several averaging theorems give conditions for ĝ being close to Ê. The con- 
ditions typically require smoothness of the functions involved and periodicity 
or near periodicity of the time functions. There are also stochastic averaging 
theorems. Notice that averaging analysis was used in Theorems 4.1 and 4.2. 
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A significant advantage of averaging theory is that it reduces the dimen- 
sions of the problem. The theorems require that the adaptation gain be small, 
but experience has shown that averaging often gives a good approximation, 
even for large adaptation gains. 

When the averaging equations are obtained, analysis proceeds in the con- 
ventional manner by investigation of the equilibria of the averaged equations 
and linearization at the equilibria to determine the local behavior. Notice that 
the averaged equations may possess equilibria ({i.e., solutions to avg{ge} = 0) 
even if the exact equations do not have an equilibrium. This corresponds to the 
case in which the true parameters are meandering in the neighborhood of the 
equilibrium to the averaged equation. 


Sinusoidal Driving Forces 


A simple case of averaging is when the external driving force is sinusoidal, 
that is, v(t) = ug sin wt. The signals g and e are then given by 


Y(t) = Gord, @)v(t) 
e(t) = Ga(d,a)v(t) 


Notice that controller parameters © depend on ĝ. The following result is useful 
for calculation of the averages. 


LEMMA 65 Averaging for sinusoidal input 
Let G, and Gae be stable transfer functions, and let v and w denote the steady- 
state responses of the corresponding systems to the input u, = ug sin wt. The 
mean value of the product vw is then given by 


u2 
avg{vw) = > |G,(t@)| |G, (i@)| cos (arg G, (iw) — arg G,,(i@)) 


2 

= T Re (G,(i@)G,,(—ia)) 
Proof: The signals v and w have the amplitudes |G,(iw)| and |G,,{i@)|; their 
phase angles are arg G, (iœ) and arg G,,(iw). Integrating over one period gives 
the result. o 


A true parameter equilibrium exists if the equation 
Ge (8 (8), o) =0 


has a unique solution. To derive a necessary condition we consider the averaged 
equation 

dð - 

T = y Re {Gyr (8(8),0) RVG (88), —0)} (6.51) 
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where 
Ry = avg (vv7) 


A necessary condition for Eq. (6.51) to have a unique parameter equilibrium 
is that v and 6 have equal dimension and that R, be of full rank. To have 
a unique parameter equilibrium for slow external driving signals, it is thus 
necessary that the number of estimated parameters be less than or equal to 
the number of external driving signals and that the external driving signals be 
persistently exciting. This result indicates that there may be some disadvan- 
tages to overparameterization, contrary to what is indicated in Theorem 6.7. 
The local stability of the equilibrium 8° is given by the linearized equation 


dx 
dt 


where x denotes the deviation from the equilibrium 6 — @° and 


= Ax 


A= Gy, (8(6),0) Ry & GF, (0(0"),0) 


The preceding equations can be applied to slow or constant perturbations by 
setting @ = 0, provided that the assumptions of averaging are fulfilled. 


An Example of Averaging Analysis 


Consider a process with the transfer function &G{s) and an adjustable feed- 
forward gain. Find a feedforward gain 6 such that the input-output behavior 
matches the transfer function kaG,,(s) as well as possible. It is assumed that 
k > 0 and ko > 0. The case Gm = G was discussed in Chapter 5. Two different 
algorithms for updating the gain were proposed in Chapter 5: the MIT rule 
and the SPR rule. The algorithms are 


dO 

u Tre (MIT) 

dô (6.52) 
g 7e (SPR) 


where u, is the command signal, ym = koGmlte is the model output, and e is 
the error defined by 


e(t) = y= Ym = kG(p) (6()u.(t)) ~ koGm{p)te (t) 


The analysis in Section 5.5 shows that the MIT rule gives a closed-loop system 
that is globally stable for any adaptation gain y in the “ideal” case, when 
G = Gm and G is SPR. In the presence of unmodeled dynamics it is, of course, 
highly unrealistic to assume that a transfer function is SPR. So far, no stability 
result has been given for the MIT rule. However, Example 5.5 indicates that the 
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MIT rule will be unstable for sufficiently high adaptation gains if the system 
is not SPR. 

We now investigate the algorithms under nonideal conditions, using aver- 
aging. Inserting the expressions for ym and e into the equations for the param- 
eters, we get 


dé A 

az YY (koGmue) (kG (ue) — koGmuc) = 0 
6 (6.53) 
at ec ({kG(6u,) — koGmtie) = 0 


where the first equation holds for the MIT rule and the second holds for the 
SPR rule. The corresponding averaging equations are 


ad +Y (Ökko ave{(Gmle)(Gu,}} — kê avg{(Gmu.)}?}) -o0 


dð (6.54) 
a +y (Gkavg{u.(Gu.)} — ko avg{uc(Gmtte)}) = 0 
The equilibrium parameters are 
4 ko ave{(Gmuc)?} 
MIT =z avel (G u MGu)} 
k avg{(Gmue)(Gue)} (6.55) 


_ _ ko avg{u,(G,,u,)} 
OSPR = k “ave{ue(Gu,)} 


The equilibrium values correspond to the true parameters for all command 
signals u, only if G = G,, (ie, there are no unmodeled dynamics). When 
G Æ Gm, the equilibrium obtained will depend on the command signal as well 
as on the unmodeled dynamics. Notice that the equilibrium value obtained for 
the MIT rule minimizes the actual mean square error. 

The stability conditions for the averaged equations (Eqs. 6.54) are 


y ave{(G,tu.)(Gu,)} > 0 (MIT) 
y avg{u.(Gu-.)} > 0 (SPR) 


The averaged equation when the MIT rule is used will thus give a stable 
equilibrium for al] command signals if G,, = G. The stability condition depends 
on the command signal and the process dynamics as well as on the response 
model. 

For the SPR rule the stability condition depends only on the command 
signal and on the process dynamics. The equilibrium is stable for all command 
signals if G is SPR. For processes that are not SPR the equilibrium may well 
be unstable. Consider the case of a command signal composed of a constant 
and a sum of sinusoids: 


n 
u(t) = ao + 25a sin @,t 
kat 
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If Lemma 6.5 is used, the stability conditions for y > 0 become 


a2Gy,(0) + 3 a? |Gm (i@z)| |G (i@,)| cos {arg Gm (iw) — arg G{iwr)} > 0 


k=1 


agG(0) + > af ReG(ia,) > 0 


k=1 


For a single sinusoidal command signal the MIT rule gives a stable equilibrium 
if the phase lags of G,, and G differ by at most 90° at the frequencies of the 
input signal. The SPR rule, on the other hand, gives a stable equilibrium if the 
phase lag of the process is at most 90°. 

For command signals containing several sinusoidals the equilibrium can 
still be stable, provided that the command signal is dominated by components 
with frequencies in the range in which the phase lag of the process is less than 
90°. Notice that it helps to filter the command signal so that the signals in 
the frequency range in which the plant has a phase shift of more than 90° are 
attenuated. In the MIT rule, reduction of the gain of the model can also be 
reduced at high frequencies. It follows from Eqs. (6.54) that the convergence 
rate of the parameters is strongly signal-dependent. The value of normalization 
as described in Section 5.3 is that the convergence rate becomes less dependent 
on the signal amplitudes. The preceding calculations are illustrated by an 
example. 


EXAMPLE6.8 Sinusoidal command signal 
Consider a reference model with the transfer function 
a 
s+a 


Gin{s) = 


Assume that the process has the transfer function 


ab 
(s+a)(s +5) 


Furthermore, let the command signal be a sinusoid with unit amplitude and 
frequency @. Equations (6.55) give the equilibrium values 


G(s) = 


Baer P+ o? 
MT k  B 
= _ ko a(b? +o’) 
r= oee Coe 
The stability conditions show that the MIT rule is stable for all w, but the SPR 


rule is stable only if œ < vab. Figure 6.10 shows the estimates of the gain for 
the case behavior a = 1 and b = 10 when the input signals have frequencies 
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1.0 
6 
0.5 
0.0 
0 100 300 500 0 100 300 500 
` Ti 
(d) 
10 
5 M 
8 
0 
0 100 300 500 0 100 300 500 
Time Time 


Figure 6.10 Estimated feedforward gains obtained by the MIT rule with 
sinusoidal input signals having frequencies (a) @ = 3; (b) @ = 3.4; and the 
SPR rule when (c) @ = 3; (d) œ = 3.4, for a system with Gm = 1/(s + 1) 
and G = 10/((s + 1)(s + 10)). The dashed lines are the equilibrium values 
obtained from averaging analysis. 


© = 3 and œ = 3.4. The equilibrium values predicted by the averaging theory 
are also shown in the figure. The SPR is unstable for @ = 3.4 > V10. Also 
notice the drastic difference in the equilibrium values between the different 
updating methods. The desired equilibrium value is 6) = kg /k. 

The behavior is well predicted by the averaging analysis. Notice the differ- 
ence in convergence rates. Initially, when Ë = 0, the rates of changes are given 
by 


Ëy = vki avg{(Gmue)”} 


These expressions clearly show that the initial rates decrease with increasing 
frequency because |G,,(i@)| decreases with frequency. For the SPR rule the 
rate decreases even more because of the phase lag between u, and G,,u,. O 


In conclusion, we find that averaging analysis gives useful insights. It 
shows that analysis of the ideal case can be quite misleading. Even in the simple 
case of adjustment of a feedforward gain, unmodeled dynamics together with 
high-frequency excitation signals may lead to instability of the equilibrium. 
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The equilibrium analysis also makes interesting contributions to the compar- 
ison of the MIT and SPR rules. First, the equilibrium of the MIT rule has a 
good physical interpretation as the parameter that minimizes the mean square 
error. Second, the apparent advantage of the SPR rule that very high adap- 
tation gains can be used vanishes. In practical situations, there are always 
unmodeled dynamics. In the presence of unmodeled dynamics the gain must 
be kept small to maintain stability. 


6.7 APPLICATION OF AVERAGING TECHNIQUES 


In the previous sections, idealized cases were investigated. The convergence and 
stability analysis of self-tuning regulators were based on Assumptions Al—A4 
and the premise that there are no disturbances. In Chapter 5 the stability of 
MRAS was proved under the SPR assumption on certain transfer functions. 
Assumption A2 in Theorem 6.7 implies that the model used to design the 
adaptive controller must be at least as complex as the process to be controlled. 
This is highly unrealistic because real processes are often distributed and also 
nonlinear. 

In practice, adaptive controllers are based on simplified models. It is there- 
fore of interest to investigate what happens when the process is more complex 
than assumed in the design of the controller. In this case the process is said 
to have unmodeled dynamics. If a controller is able to control processes with 
unmodeled dynamics and/or disturbances, we say that the controller is robust. 


Analysis of a Simple MRAS 


A simple model-reference adaptive system for a process of first order was 
derived in Example 5.2 by using the MIT rule. In Example 5.7 the same 
problem was considered, and an MRAS was obtained by using Lyapunov’s 
stability theory. We now use averaging theory to investigate the properties 
of the controller. In designing the adaptive controller it is assumed that the 
nominal transfer function of the process is 


G(s) = 


= (6.56) 


which is not necessarily the true transfer function of the process. The desired 
closed-loop system has the transfer function 

Dm 
Stam 


G,,(3} = 


A model-reference adaptive control law was derived in Example 5.7 by using 
Lyapunov theory. A block diagram of the closed-loop system is given in Fig. 5.11. 
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The system is described by the equations 


dôs yu 
de Mee 
dê; 
ae" 

6.57 
e=¥—I3n ( ) 
y= G(p)u 

Ym = Gm (p)tte 
u = Oyu, — Boy 


where u, is the reference signal, u is the process input, y is the process output, 
Ym is the output of the reference model, e is the error, 6, is the adjustable 
feedforward gain, and 62 is the adjustable feedback gain. 

It is not possible to give a complete analysis of Eqs. (6.57) for general 
reference signals; approximations must be made even in a simple case like 
this. We now investigate the adaptive system when the reference signal is 
sinusoidal, The equilibrium points are first explored, and the behavior in their 
neighborhood is then investigated by averaging and linearization. 


Equilibrium Values for the Parameters 


It follows from Eqs. (6.57) that the parameters 6, and Ê; are constant when 
the error e is zero. The conditions for e to be zero will now be investigated. The 
signal transmission from the command signal u, to the output y is described 
by the transfer function 

_ &G 

~ l+ ôG 


e 
and the control error becomes 
e(t) = y(t) — Ym (t) = (Ge(p) — Gm(p)) uc (t) 
Let the reference signal be u, = up sin wt. The error e is then zero if 
G,(i@) = G,, (iw) 


or 
6°G(ia) = 69G,,(i@)G(ia) + Gn(ie) (6.58) 
This equation can be solved for 6° and Ê} by equating the real and imaginary 


parts. There is a unique solution if Im{G(iw)} # 0. The solutions are eas- 
ily obtained by dividing Eq. (6.58) by G,,G and G, respectively, and taking 
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imaginary parts. This gives 
o __Im{1/G(ia)} 
+ Im{1/G,,G@)} 
po — _1m{Gn(io)/G(ia)} 
Im Gm (ia) 


(6.59) 


In the nominal case we get Ô? = bm /b and 68 = (am ~a)/b. These equilibrium 
values do not depend on the frequency of the command signal. They also 
correspond to the desired feedback gains. 


Averaging 


The command signal u, is the only external signal; hence v = u,. Furthermore, 
gT = (u; y). To obtain the averaging equations, the transfer functions Gey 
and Go, are first calculated: 


AG 
1+ 6G 
êG 

ag = [~ i 

PY 14+ 6G 


By using Lemma 6.5 the averaged equations can now be written as 


ev m 


dë: __ 73 pe] AGO) g 
dt 2 Rel 6G (ia) cata) | 
(6.60) 
dë _ rub p f (_&GGa) _ & Gq) Gio) 
dt 2 1+&G(iw)  ” 1+ 6.G(-ia) 
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Time 


Figure 6.11 Parameter estimates and their approximation by the averaging 
method. The dashed lines show the equilibrium values of the gains. 
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Figure 6.12 System output y (solid line} and the output of the reference 
model ym (dashed line) and error e for Example 6.9 for f=0-20 and £=50-70. 


Notice that these equations are valid also when G is a general transfer function, 
that is, G does not need to satisfy Eq. (6.56). 


| EXAMPLE 6.9 Accuracy of averaging 


Consider the particular case of a = 1, b = 2, and am = bm = 8. Let the 
adaptation gain y be 1, and let the command signal be uwosiné. The time 
histories of the parameter estimates 6,, 62 and their approximations ĝi, 6. 
are shown in Fig. 6.11. The figure shows that the averaging gives a good 
approximation in this case. Notice that the approximation improves with time. 
The process output y and the output of the reference model Ym are shown in 
Fig. 6.12. Notice that the signals are already quite close after 10 s, although 
the parameters are quite far from their correct values at this time. The error 
e€ = y — Ym thus appears to converge much faster than the parameters. This 
was seen for several different adaptive controllers in the previous chapters. 
Also notice that much faster convergence will be obtained with a recursive 
least-squares method. oO 


Local Stability 
The stability of the equilibrium of the averaged equations (Eqs. 6.60) will now 
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be investigated. Straightforward but tedious calculations give the following 
lhnearized equation: 


eer (6.61) 


where x is a vector whose two components are the deviations of 0, and 62 from 
their equilibrium values and the matrix A is given by 


2 
yuzlGn| | — COS Öm |G,,| cos 26,, ) 
A — A 6.62 
26° [Gm] —lGn|* cos 8m on 
where §° is the equilibrium value of 8, and Om = arctan(w/a,,). The matrix A 
has the characteristic equation 


A? + aA(1 + cos” Om) + a sin? 6,, = 0 


where 
ae VUZA pb 
2 (a2 + w?) 


The characteristic equation has its zeros in the left half-plane if œ Æ 0. The 
equilibrium of the linearized equation (Eq. 6.61) is thus stable for all œ # 0. 
The investigated MRAS has been designed by using Lyapunov theory. In the 
idealized case the transfer function (6.56) is SPR, and it is expected that the 
MRAS should have good performance. 


Unmodeled Dynamics 


The consequences of unmodeled dynamics are now investigated for the MRAS 
shown in Fig. 5.11. This system was designed on the basis of the assumption 
that the transfer function of the process has the form (6.56). We now investigate 
what happens if the process actually has a pole excess larger than 1. Before 
we go into details, a specific example is investigated. 


EXAMPLE 610 Unmodeled dynamics 


Assume that the nominal transfer function (6.56) has a = 1 and b = 2 but that 
the actual transfer function is 


458 


G(s) = Gy I + 80s + 229) 


(6.63) 
The dynamics correspond to the nominal plant 2/(s+1) cascaded with 229/(s?+ 
30s + 229). The process thus has two poles s = —15 + 21, which were negiected 
in the model used to design the adaptive controller. Figure 6.13 shows the 
behavior of the controller parameters when the command signal is a step and 
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6, (no noise) 
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8; 
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Time 


Figure 6.13 Controller parameters 6, and 6, when the adaptive control law 
of Eqs. (6.57) is applied to the process of Eq. (6.63). The command signal is 
a step, and there is sinusoidal measurement noise. The smooth curves show 
the behavior when there is no measurement noise. 


there is a sinusoidal measurement error. Figure 6.14 shows the behavior of the 
parameters when the command signal is sinusoidal with different frequencies. 
oO 


Example 6.10 shows that the presence of unmodeled dynamics will drasti- 
cally change the behavior of the adaptive system. Figure 6.14 shows that the 
equilibrium depends on the frequency of the command signal and that it may 
be unstable for certain frequencies, We now attempt to understand the mecha- 
nisms that change the behavior of the system so drastically and to find suitable 
remedies. 


Step Commands 


First, the behavior illustrated in Fig. 6.13 is analyzed. The case of step com- 
mands is first investigated when there is no measurement noise. When œ = 0, 
the equilibrium condition of Eq. (6.58) reduces to 


= 1 -> 1 
62 = = A 


n0 GO) Ge 
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Figure 6.14 Controller parameters 6, and 6, when the adaptive control law 
of Eqs. (6.57) is applied to the process of Eq. (6.63) when the command signal 
is v, = sin wt with (a) æ = 1; (b) œ = 3; (c) œ = 6; (d) œ = 20. 


The equilibrium set is thus a straight line in the parameter space. The line 
is uniquely determined by the steady-state gains G(0) and G,,,(0). Notice in 
particular that the equilibrium set is not a point. This is easily understood from 
the viewpoint of system identification. We wish to determine two parameters, 
6; and 6,. However, the excitation used is a step that is persistently exciting 
of first order and thus admits determination of only one parameter. (See 
Example 2.5.) 

Averaging is now applied to obtain further insight into the behavior of the 
system. The averaging analysis applies to the set of parameter values such that 
the closed-loop system is stable for fixed parameters. To find this set, notice 
that the closed-loop system is a lmear time-invariant system when parameters 
6, and 62 are constant. The closed-loop eigenvalues are the zeros of the equation 


1+6G(s) =0 


A necessary condition for stability is that 1+ 62G(s) has its roots in the left 
half-plane. This condition is also sufficient in the nominal case because the 
transfer function G(s} is then SPR, and arbitrarily large feedback gains can 
be used. When there are unmodeled dynamics, the transfer function G(s) is 
usually not SPR and the closed-loop system typically becomes unstable when 
62 is sufficiently large. 
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. EXAMPLE 6.11 Step commands 


With the transfer function of Eq. (6.63) used in Example 6.10, the closed-loop 
characteristic equation is given by 


(s + 1)(s* + 30s + 229) + 4580, = 0 


or 
s3 + 319” + 259s + 229 + 4580; = 0 


This equation has all roots in the left half-plane if 
—0.5 < 62 < 17.03 = es 


The averaged equations for the parameter estimates are obtained by setting 
œ = 0 in Eqs. (6.60). If it is assumed that G,,(0} = 1, the equations become 


dô, = r _8G(0) — Gm (0) 
dt 2 \14+4G(0) =” 

dö _ yu, _ 8 G(0) aG) Gn (0) 

di 2 1+ẸG((0)\1+%60) 7” 
These differential equations have the equilibrium set of Eq. (6.64). 


Close to the equilibrium set, the equations are described by the following 
linearized equation: 
dx yuz ey 1 5 


ae 36 (4-1 


(6.65) 


(6.66) 


where xı = 6,—69 and x2 = 8,—68. Consider a point away from the equilibrium 
line, that is, xz = xı + ô or 4 = 6, — 1/G (0) + ô. The velocity of the state 
vector at that point is 4 = yu25/@°, te = —yu25/6%. The vector field of 
the linearized equation is thus as shown in Fig. 6.15. The vector field thus 


8z 7 


Stability boundary ~ 


Figure 6.15 Equilibrium set and local behavior of the averaged equations. 
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Figure 6.16 Phase plane of the controller parameters (a) in the nominal 
case of G(s) = 2/(s+1) and (b) in the case of unmodeled dynamics Eq. (6.63). 
The dashed lines are the equilibrium sets of the parameters in the nominal 
case, 


pushes the parameter toward the equilibrium for @, > 0 and away from the 
equilibrium for 6, < 0. Notice that the system is not structurally stable because 
one eigenvalue of the linearized equation is zero. This means that we can expect 
drastically different properties when the system is perturbed. 

It is usually difficult to go beyond the local analysis. However, in this 
particular case it is possible to obtain the global properties of the averaged 
equation. Outside the equilibrium set of Eq. (6.64), the averaged equations 
(Eqs. 6.65) can be divided to give 


dë __ GO) 


dé, 1+ 4,G(0) 
This differential equation has the solution 


Ge + +a 62 + & = const 
The parameters of the averaged equations will thus move along circular paths 
with the center at (0, —1/G(0)). The motion is clockwise for 62 > 6; —1/G(0) 
and counterclockwise for 82 < 6, —1/G(0)}. The motion slows down and stops 
when the parameters reach the equilibrium set 


(ön, G26, >0, 62 = 8; —1/G(0)} 


The averaged equation approximates the nonlinear equations for the param- 
eters only for parameters such that the closed-loop system is stable. In the 
nominal case, when the transfer function of the plant is G(s) = 2/(s + 1), the 
stability region is ~1/G(0) < 6». In the case of unmodeled dynamics the stabil- 
ity region is defined by —1/ G(0) <Ê < ĝztab, This means that trajectories that 
start far away from the origin will escape from the stability region. Figure 6.16 
shows the actual parameter paths in the nominal case and for the unmodeled 
dynamics given by the transfer function of Eq. (6.63) in Example 6.10. With 
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unmodeled dynamics the trajectories will diverge if the initial values are too 
large. The deviation from circular arcs is due to the initial transient when y(t) 
is different from the equilibrium value. The adaptation gain used in the exam- 
ple is quite large (y = 1). The trajectories will be arbitrarily close to circles by 
choosing y sufficiently small. The “jitter” in the trajectories in Fig. 6.16(b) is 
caused by oscillations in the parameters, not numerical errors. D 


The analysis and the simulations show that the adaptive system can be 
unstable if the input signal is a step and if there are unmodeled dynamics. 


Measurement Noise 


We now investigate the effects of measurement noise. The simulation shown in 
Fig. 6.13 indicates that measurement noise may cause the parameters to drift. 
Figure 6.17 shows parameter 62 as a function of parameter 6, with and without 
measurement noise. The simulation indicates that the equilibrium is lost in the 
presence of measurement noise. The parameters move toward a set close to the 
equilibrium set, oscillate rapidly in the neighborhood of this set, and drift along 
the set. The analysis tools developed will now be used to explain the behavior 
of the system. Assume that the command signal is a step with amplitude up 
and that the measurement noise can be modeled as an additive zero mean 
signal n at the process output. It follows from Eqs. (6.57) that the error cannot 


0 0.2 0.4 0.6 0.8 1. 
6; 


Figure 6.17 Phase plane of the controller parameters (a) with and (b) 
without measurement noise. 
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be made identically zero by proper choice of the parameters. Hence no true 
equilibrium exists such that the parameters are constant. The phenomenon is 
a typical behavior of a system that lacks structural stability. Intuitively, the 
results can be explained as follows: A step input is persistently exciting of order 
1 only, which means that it admits consistent estimation of one parameter only. 
When two parameters are adjusted, the equilibrium values of the parameters 
make a submanifold, not a point. Measurement errors and other disturbances 
may cause the parameters to drift along the equilibrium set. In the presence 
of unmodeled dynamics, the feedback gain may then become so large that the 
closed-loop system becomes unstable. By using averaging, the equilibrium set 
and the drift rate along the set can be determined. 

The parameter values will drift also in the nominal case. However, the 
closed-loop system is stable for all parameter values. 


Sinusoidal Command Signals 


Several of the difficulties encountered with step commands are due to the fact 
that a step is persistently exciting of first order only. This means that the 
equilibrium set is a manifold and only a linear combination of the parameters 
can be determined. With a sinusoidal command signal that is persistently 
exciting of second order, two parameters can be determined consistently. It may 
therefore be expected that some of the difficulties will disappear. However, the 
simulation shown in Fig. 6.14 indicates that there are some problems with 
sinusoidal command signals in combination with unmodeled dynamics. 

As before, it is assumed that the adaptive controller is designed as if the 
process were described by the transfer function 


b 
sta 


Since the character of the unmodeled dynamics is important, it is assumed 
that the actual plant is described by the frequency function 


G(s) = 


4) = —ig{@) 
G(ia) = aio r(@)e (6.67) 
The functions r and ¢ represent the distortions of amplitude and phase due to 
unmodeled dynamics. It is assumed that the transfer function corresponding 
tor and ¢ has no poles in the right half-plane. 

The unmodeled dynamics may change the properties of the system dras- 
tically. For example, the nominal system will be stable for all values of the 
feedback gain, since it is SPR. If the unmodeled dynamics are such that the 
additional phase lag can be large, the system with unmodeled dynamics will 
be unstable for sufficiently large feedback gains. The critical gain can be de- 
termined as follows. The phase lag of the plant is ¢(@) +-arctan(@/a). This lag 
is x if o 

© = tan (z —9(«)) = —tan p(o) 
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or 


cos $(@) + asing(w@) = 0 (6.68) 
The process gain of this frequency is 
. brw) 
G(ia)| = ———— 
CON Tart 
The system thus becomes unstable for the gain 
z _ p_ Var+o? 
6. = @& = <m . 
2 2 bro) (6.69) 


where @ is the smallest value that satisfies Eq. (6.68). 


Equilibrium Analysis 


The possible equilibria of the parameters will first be determined. Introducing 
the transfer function of Eq. (6.67) into Eq. (6.59) gives (after straightforward 
but tedious calculations) 


6, = Om . (asin g(a) + aos g(a) 
mS or(a) 
é, = Phen Taos Ao t (ors ane) sin ete) (6.70) 


= ggg ((@sinolo) — a0os9(e)) + ŽE (asin g(a) + o cos p(o) 


A comparison with the nominal case shows that the equilibrium will be shifted 
because of the unmodeled dynamics. The shift in the equilibrium depends on 
the frequency of the input signal as well as on the unmodeled dynamics. 

It is of particular interest to determine whether there are conditions that 
may lead to difficulties. The feedforward gain vanishes for frequencies such 
that Eq. (6.68) is satisfied. This is precisely the frequency at which the process 
has a phase lag of 180°. The feedback gain for this frequency is 


a 1 , _ va* + @* 
Qo = arlaj 229 — a cosg) = ~ br(@) 


This implies that 62|G(iw)| = 1, that is, that the loop gain then becomes unity. 

We thus find that the equilibrium values of the parameters for sinusoidal 
input signals will depend on the unmodeled dynamics and the frequency of the 
sinusoidal command signal. When the frequency is such that the plant has a 
phase shift of 180°, the feedforward gain is zero and the feedback gain is such 
that the closed-loop system is unstable. This observation is illustrated by an 
example. 
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EXAMPLE612 Sinusoidal command signal 
Consider the system in Example 6.10. The transfer function with the unmod- 
eled dynamics is 
458 458 
(s+ 1)(s? + 30s +229) 59 4 81s? + 259s + 229 
The equilibrium values of the controller gains are 


G(s) = 


i = 3(259 — œ?) 
n 458 

-~  2(137+ 7w?) 

M= 229 


when äm = bm = 3. The transfer function G has a phase shift of 180° at 
@ = /259 = 16.09. At this frequency the equilibrium values of the controller 
gains are 6, = 0 and @2 = 3900/229 = 17.03. The closed-loop system is unstable 
for this feedback gain. This explains the results shown in Fig. 6.14. m 


Summary of the MRAS Examples 
The investigation of the first-order MRAS is summarized in the following table: 
Inputs Exact Model Structure Unmodeled Dynamics 


Step command Equilibrium set is a half-line. Equilibrium set is a line 
segment. Stability is lost for 
some initial values. 


Step command + Solution will move toward a Solution will move toward a 
measurement noise line and then drift along the line and drift along the line 
line. until stability is lost. 
Sinusoidal Equilibrium set is a point Equilibrium set is a point 
that is independent of the that depends on the 
frequency. frequency. The equilibrium 
is unstable for sufficiently 
high frequencies. 


Several interesting conclusions can be drawn from the examples. When 
the input signal is not sufficiently exciting, the equilibrium is a manifold 
independently of the presence of unmodeled dynamics or disturbances. When 
there are disturbances, the estimates will drift along the manifold. In the 
case of unmodeled dynamics the closed-loop system may eventually become 
unstable. From a methodological point of view the examples give insights that 
can be derived from equilibrium analysis, which can be carried out with a 
moderate effort in many cases. We can find out if an equilibrium exists in the 
sense that the parameters remain constant. Notice that the averaged equations 
may have an equilibrium even if the exact equations do not. However, it is rarely 
the case that global analysis can be carried out. 
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6.8 AVERAGING IN STOCHASTIC SYSTEMS 


The importance of averaging was illustrated in the previous sections. However, 
the excitation has been restricted to constant or sinusoidal inputs. In this sec- 
tion averaging is used on discrete-time systems with stochastic inputs. Assume 
that the system is described by 

A*(q™)9(t) = B*(q7)u(t — d) + C*(q™)e(t) (6.71) 


where e(t} is a zero-mean Gaussian stochastic process. Depending on the spec- 
ifications, different self-tuning regulators can be used to control the system 
(compare Chapter 4). For simplicity it is assumed that the basic direct self- 
tuning algorithm (Algorithm 4.1) is used. The controller parameters are then 
estimated from a model of the form 
y(t) = R (aul — d) + S*(q™ ot — ad) (6.72) 
or 
y(t) = p(t—d)"@ (6.78) 
The parameters 0 are estimated by using the recursive least-squares method. 
In applying averaging, it is appropriate to use the form 
6() = Ô(t — 1) + (HRE eC — d) (x) — g7 (t—d)ê(t— 1)) 
R(t) = R(t— 1) + y(t) (p — dp" (t-d) - R — 1)) 
where the covariance matrix P(t) is related to R(é) through 


P(t) = 7()R(t)) 


and y(t) = 1/t. In some cases it is convenient to replace the matrix R(t) by 
a scalar r(t). This gives shorter computation times and requires less storage, 
but it gives slower convergence. For stochastic approximation we obtain 


r(t)=r(t—1) + x(t) (g(t — d) o(t—d)— r(t- 1)) (6.75) 


The controller is 


(6.74) 


S(¢") 


MO =~) 


y(t) (6.76) 


or 
g(t)" A(t) = 0 

The self-tuning regulator is described by Eqs. (6.73) and (6.74). The control 
law of Eq. (6.76) is then used on the system of Eq. (6.71). The resulting closed- 
loop system is a set of nonlinear, stochastic difference equations, which can 
be very difficult -to analyze. The difficulty arises mainly from the interplay 
between the estimated parameters as well as the fact that these parameters 
are used in the controller. By using the averaging idea it is possible to derive 
associated deterministic differential equations. The convergence properties of 
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the algorithm can then be determined by using these equations. The method 
was suggested by Ljung in 1977 and is sometimes called the ODE (ordinary 
differential equation) approach. Only a heuristic derivation and motivation are 
given here; further details can be found in the references at the end of this 
chapter. 


A Heuristic Derivation 


For sufficiently large ¢ the step size y(t) in Eqs. (6.74) is small, and the 
correction in 6(t) is small. As in Section 6.6, we can separate the states from 
the parameters and assume that the parameters are constant in evaluating the 
behavior of the closed-loop system. Both R(t) and p(t) depend on the parameter 
estimates. Since Ô is assumed to change slowly, the behavior of the model can 
be approximated by 
y(t) = p" (t—d, €)0 

where @ is the averaged value of the estimates. Also, ọ depends on the es- 
timated variables through the feedback. The updating equation for R can be 
approximated by 


R(t) = R(t — 1) + y(t) (G(8) — R(t — 1)) (6.77) 
where 
GO) = E {p(t 4,29" (t—4,0)} (6.78) 
The expectation is taken with respect to the underlying stochastic process in 
Eg. (6.71) and evaluated for the fixed value of the parameters @. In the same 
way the parameter update is approximated by 


ölt) = A(t — 1) + (DË) (6.79) 
where 
f (8) = E {p(t — 4,8) (y(t) — p” (t — 4, 6)8)} (6.80) 
Equations (6.79) and (6.77) are the averaged difference equations describing 
the estimator. Now let At be a small number, and let t be defined by 


r 
At => y(k) 
k=t 

Then _ . E _ 

5E) = H(t) + ARF (8(0)) 

R(t’) = R(t) + Ar (G (A(£)) — R(z)) 
With a change of time scale such that t = ¢ and ¥ = t+ At, these equations can 
be seen as a difference approximation of the ordinary differential equations 


ae = R(t)’ f (6(z)) (6.81) 


“ = G (@(r)) — R(T) (6.82) 
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If stochastic approximation is used, Eq. (6.82) is replaced by 


dF 


qr = 9 (80) — Fe) 


where 
g8) =E {pP t-d} 

and F is replaced by F in Eq. (6.81). These equations are called the associated 
ordinary differential equations to Eqs. (6.74) and (6.75). They are a special 
kind of averaged equations. First, the difference equations are replaced by dif- 
ferential equations; second, there is a time scaling compared with the original 
system. The time scaling can be interpreted as a logarithmic compression of 
the original time. That is, more and more steps of length y(t) are needed to 
get the step Ar as the time progresses. 

The arguments leading to Eqs. (6.81) and (6.82) have been heuristic. 
However, it can be rigorously shown that, provided that the estimates 6(£) are 
“sufficiently often” in the domain of attraction of the associated differential 
equations, then 

e Only stable stationary points of Eqs. (6.81) and (6.82) are possible conver- 
gence points for the estimates. 
» The trajectories 6(z) are the “asymptotic paths” of the estimates 8(2). 
The associated ODE can be used to find possible convergence points of an 
adaptive algorithm, 6° and R°. The equations can then be linearized around 
these stationary points. It is easily seen that the linearized equations are 


35 v1 OF (8) 5 5 
se) (Cre e) Ce) 


where the element X is not important for the local stability. The stationary 
point is thus stable if the matrix 


_ nra OF) 
K = Gð F _ x (6.83) 


has all its eigenvalues in the left half-plane. The associated ODEs can thus be 
used in the following way: 
1. Compute the expressions for p(t) and e(t) = y(t) — p(t — d) ð for a fixed 
value of @. 
2. Compute the expected values G(@) and f(8). 
3. Determine possible convergence points for Eqs. (6.81) and (6.82), and 
determine the local stability properties by using Eq. (6.83). 
4. Simulate the equations. 
Even if Eqs. (6.81) and (6.82) can be quite difficult to analyze in detail, it 
is usually easy to determine the possible stationary points. The equations can 
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also be simulated to obtain a feel for the behavior of the convergence properties. 
The change in the time scale makes it more favorable to simulate the ODEs 
than the averaged difference equations. 


Stability of Stochastic Self-Tuners 


Averaging methods can be used for stability analysis of stochastic self-tuning 
regulators. Consider a simple self-tuner based on least-squares estimation and 
minimum-variance control (Algorithm 4.1 with Q* = P* = 1). Let the algorithm 
be applied to a system described by Eq. (6.71). The self-tuner is assumed to 
be compatible with the model in the sense that the time delay and the model 
orders are the same. The closed-loop system is globally stable if the pulse 
transfer function i i 

E C{z) 2 


is SPR (see Ljung (1977b)). The filter P* that is used to filter the regressors 
can be interpreted as an estimate of the observer polynomial C*. The condition 
for global stability is then that the transfer function 


is SPR. The local stability condition is that the real part of polynomial C(z)} 
is positive at all zeros of the polynomial B(z) (see Holst (1979)). The method 
with stochastic averaging is illustrated with three examples. 


EXAMPLE 6.13 Stochastic averaging 
Consider the system 


y(t) + ay(t — 1) = u(t — 1) + bu(t — 2) + e(t) + ce(t — 1) 
with a = —0.99, b = 0.5, and c = —0.7. Let the estimated model be 
y(t) = u(t — 1) + riu(t — 2) + soy(t — 1) 
and use the controller 
u(t) = —soy(t) — riu (t — 1) 


The closed-loop system is described by 


Fo sche 2, a 
(1 + aq-!)(1 + rig7!) + sog (1 + bq) 
i= ~s0(1 + 6g") elt) 


(1+ag!)(1 + rig!) + sog-1(1 + bgt) 
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Figure 6.18 Phase plane for the controller parameters in Example 6.13 
when recursive least-squares estimation is used. {a) Trajectories of the as- 
sociated ODE. (b) Realizations of the difference equations. The parameter 
values corresponding to the minimum-variance controller are indicated by a 
dot. 


In this case, 


g“ (t-1)=(u(t-2) y(t—1)) &=(rı so) 
and 
e(t) = y(t) 
Thus 


a= (Za) = (ey zo 


where r,(t), ru(T), and ry,(t) are the covariance functions of y and u and the 
cross-covariance between y and u. 

The stationary point is given by f(@) = 0, which gives r,,(2) = 0 and 
ry(1) = 0. This is exactly the result obtained in Theorem 4.1. Figure 6.18(a) 
shows the phase plane of the ODE when recursive least-squares estimation 
is used. The stationary point corresponds to the minimum-variance controller, 
and the triangle indicates the stability boundary for the closed-loop system. 
Figure 6.18(b) shows realizations of the estimates 8) and 7; when recursive 
least-squares estimation has been used. The estimator is started with a very 
small step size. The realizations agree very well with the trajectories of the 
ODE. The ODEs have been simulated for 0 < t < 50; 75,000 steps had to be 
simulated for the difference equations in Fig. 6.18(b). A forgetting factor of 
A = 0.99995 was necessary to get close to the stationary point. Oo 


EXAMPLE 6.14 Moving-average self-tuner 
Consider an integrator with a time delay t. (Compare Example 4.6.) For the 
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time delay T < A the system is described by 
A(q) = a(¢—1) 
B(g) = (k— rtjg +t = (h — rt) +b) = (h—7)B’ 
C(q) = a(g+0¢) 


where 


and = |c| <1 


The system is minimum-phase, |b| < 1, when t < h/2. Moving-average con- 
trollers of different orders will now be analyzed. (Compare Section 4.2.) 

Case 1 (d=1) 

The mmimum-variance strategy obtained through 


AR + (h —T)B'S = B'C 


giving 
Ra) =q +b 
l+c 
S@)=;,_74 


is the only possibility to get a moving average of order zero. Since the process 
zero 1s canceled, it is necessary for stability that the system be minimum-phase. 
The characteristic equation of K in Eq. (6.83) is in this case 


atn (a+ pp) =0 


Since |b| and |c] are both less than 1, it follows that the eigenvalues of K are 
both negative. 


Case2 (d=2) 
Since B is of first order and C is of second order, there are several possibil- 
ities to get an output that is a moving-average process. We get the following 
combinations: 

Case Bt 

2(a) g+6 Minimum variance 

2(b) q+ Deadbeat 

2(c)} 1 Moving average 


To investigate the equilibria, first notice that Cases 2(a) and 2(b) can give 
stable equilibria only if b < 1 (i.e., 7 < 4/2). 
Case 2(a) corresponds to the minimum-variance controller. The character- 
istic equation of the matrix K is 
bc 


-aferet ip) + x0 
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Figure 6.19 Simulation of the ODEs of the parameter estimates for the 
integrator when d = 2 and c = —0.8. (a) 7 = 0.4. (b) t = 0.6. The parameter 
values corresponding to the moving-average controller are indicated by dots. 


Since b is nonnegative, it follows that this equation has roots in the right half- 
plane or at A = 0 for all c in the interval (—1,1). The equilibrium is thus 
always unstable. 

In Case 2(b) the characteristic equation of the matrix K is given by 


(1+ ¢7)(1 — be)? + c#(1— b?) 
~ e(c — b)(1— be) 


This equation has all roots in the left half-plane if b < c. 
In Case 2(c), moving-average control, the characteristic equation is 


A? + 2A(b —c) + b(b—c) = 0 


Since b is positive, it follows that this equation has its roots in the left half- 
plane if b > c. Notice that the moving-average controller is locally stable for 
b>cevenifh/2 <7 < h, that is, when the controlled process is non-minimum- 
phase. 

Summarizing, we find that if d = 1, there is only one equilibrium, which 
corresponds to the minimum-variance control. This equilibrium is locally stable 
only if 7 < h/2. When d = 2, there are three equilibria, corresponding to Cases 
2(a), 2(b), and 2(c). Equilibrium 2(a) is always unstable; equilibrium 2(b) is 
stable if b < c; and equilibrium 2(c) is stable if b > c. 

The phase portraits of the ODEs associated with the algorithm are shown 
in Fig. 6.19 for the case in which d = 2 and c = —0.8. When 7 = 0.4, there 
are three equilibria. They correspond to Case 2(a), which is a saddle point, 
Case 2(b), which is an unstable focus, and Case 2(c), which is a stable node. 
The stable node corresponds to the moving-average controller. The parameters 
are r, = 0.08 and so = 0.20. For t = 0.6 there is only one equilibrium, which 
corresponds to the moving-average controller with the parameters r; = 0.12 
and so = 0.20. Figure 6.19 also shows that starting points exist for which the 
algorithm does not converge. The estimates are driven toward the stability 
boundary. o 


2 
a4 FE yi0 
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The examples show how it is possible to use the associated ODE both to 
analyze the system and to get a feel for the behavior close to the stationary 
points as well as far away from them. 


EXAMPLE 6.15 Local instability of a minimum-variance STR 
Consider a process described by 
y(t) — 1.6y(t — 1} — 0.75y(¢ — 2) 
= u(t — 1) + u(t — 2) + 0.9u(t — 3) + e(t) + 1.5e(t — 1) + 0.75e(t — 2) 
The B polynomial has zeros at 
ag = —0.50 + 0.81 


Furthermore, 
C(z212) = —0.40 + 0.407 


The real part of C is thus negative at the zeros of B. This implies that 
the parameters corresponding to minimum-variance control make an unstable 
equilibrium for the ODEs. Furthermore, it follows from Theorem 4.2 that these 
parameter values are the only possible equilibrium point for the parameters. 
The following heuristic argument indicates that the estimates are bounded: 


OO | (uae eee Yaka | "bg ie eames a nT RES Ss 
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Figure 6.20 The parameter estimates when a self-tuning controller is used 
on the process in Example 6.15. The dashed lines correspond to the optimal 
minimum-variance controller. 
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If the parameters are such that the closed-loop system is unstable, the inputs 
and the outputs will be so large that they will dominate the stochastic terms in 
the model. The estimates will then quickly approach values that correspond to 
a deadbeat controller for the system, which gives a stable closed-loop system. 
This argument can be made rigorous (see Johansson (1988)). The estimates 
will thus vary in a bounded area without converging to any point. Figure 6.20 
shows the parameter estimates when a direct self-tuning regulator with the 
controller structure 


ŝo +41¢ 4 
u(t) = — a ave) 


1+ fig t+Fe 

is used. The simulation is initialized with values that correspond to the 
minimum-variance controller. The simulation is done by using RLS with a 
forgetting factor 2 = 0.98. Figure 6.20 shows that the estimates try to reach 
the optimal values but are repelled when they get close. Notice also that the be- 
havior is similar to that shown in Fig. 6.3. The example shows that a minimum- 
phase system exists in which the parameters corresponding to the minimum- 
variance controller are not a stable equilibrium for the self-tuning algorithm. 
This particular example led, in fact, to extensive research effort on the stability 
of stochastic self-tuners. o 


6.9 ROBUST ADAPTIVE CONTROLLERS 


In the previous sections we showed that both continuous-time and discrete-time 
adaptive controllers perform well in idealized cases. For the discrete-time self- 
tuning regulator, Assumptions Al—A4 in Theorem 6.7 were necessary to prove 
convergence and stability. The examples indicate that the MRAS algorithm in 
Eqs. (6.57) is incapable of dealing with unmodeled dynamics and disturbances. 
The insight given by the analysis also suggests various improvements of the 
algorithms. In this section, different ways to improve the robustness properties 
are discussed. 

The first and most obvious observation is that the underlying controller 
structure must be appropriate. A pure proportional feedback is not appropri- 
ate, since the controller gain should be reduced at high frequencies to maintain 
robustness. Notice that a digital control law with appropriate prefiltering gives 
a very effective reduction of gain at frequencies higher than the Nyquist fre- 
quency associated with the sampling. However, any use of filtering in this way 
requires prior information about the unmodeled dynamics. 

The examples also show that Theorem 6.7, although it is of significant 
theoretical interest, has limited practical value. The theorem clearly will not 
hold if Assumption A2 is violated. This assumption will not hold in a practical 
case, in which there are always unmodeled dynamics. It is also not realistic 
to neglect disturbances. This raises the possibility that global stability can be 
established only under unrealistic assumptions. 
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Theorem 6.7 also gives poor guidelines for the choice of controller com- 
plexity. To satisfy Assumption A2, it seems logical to increase the controller 
complexity. However, this will impose additional requirements on the input 
signal to maintain persistency of excitation. 


Projections, Leakage, and Dead Zones 


Equilibrium analysis based on averaging shows that the equilibria depend on 
the unmodeled dynamics and the nature of the command signal in a compli- 
cated way. Some general conclusions can be extracted, however. If the command 
signal is not persistently exciting of an order that corresponds to the number 
of updated parameters, the equilibrium set will in general be a manifold rather 
than a point. For systems that are linear in the parameters, the equilibria will 
actually be an affine set, which means that the controller gains may be very 
large on some points of the set. Small amounts of measurement noise or other 
disturbances may then cause a loss of equilibrium and result in drift of the 
parameters. 

Several ideas have been proposed to modify the adaptive algorithms to 
avoid the difficulty. One possibility is to modify the algorithm so that the 
parameters are projected into a given fixed set. However, this requires that 
appropriate prior knowledge be available. For example, in Example 6.11 it is 
sufficient to project into a set such that 0 < Ê < 17. A convenient way to obtain 
a controller with a finite gain is to introduce a path parallel to the process with 
gain p. Let G, be the transfer function of the controller. The arrangement with 
the parallel path is equivalent to use a controller with the transfer function 


G 
r r 
G= 1+pG, 


This is clearly bounded by 1/ when G, has high gain. 

In Section 5.3 we showed that the normalization in the estimator (6.2) 
is important to improve the properties of the algorithms. The normalization 
comes automatically when least-squares methods are used. Another modifica- 
tion is to change the parameter updating in Eq. (6.2) to 


ae =Y ETT + œ (8° — 8} (6.84) 
where @° is an q priori estimate of the parameters and @, > 0 is an appropriate 
constant. The added term œ (0° —ô)}, sometimes called leakage, will make sure 
the estimates are driven toward 0° when they are far from 8. However, the 
modification will change the equilibrium. A priori knowledge is also required 
to choose 8° and @ı. 

To avoid the problem of shift in equilibria, the following modification has 
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also been suggested: 


dé e a 
=Y Tg + a|e|(6° — ô) (6.85) 
A third way to avoid the difficulty is to switch off the parameter estimation if 
the input signal is not appropriate. There are several ways to determine when 
the estimates should be switched off. A simple way is to update only when 
the error is large, that is, to introduce a dead zone in the estimator. Such an 
approach is discussed below. However, it is necessary to have prior knowledge 
to select the dead zone. 

It has also been suggested that the width of the dead zone be varied 
adaptively. From the equilibrium analysis it appears more appropriate to use 
a criterion based on persistent excitation. An alternative to switching off the 
estimate is to introduce intentional perturbation signals so as to ensure a 
proper amount of excitation. 


Filtering and Monitoring of Excitation 


From the system identification point of view the problem of unmodeled dynam- 
ics can be interpreted as follows. In fitting a low-order model to a system with 
complex dynamics, the results depend critically on the frequency content of 
the input signal. Precautions must thus be taken to ensure that the frequency 
content of the input signal is concentrated to the frequency range at which the 
simple model is expected to fit well. This indicates that the signals should be 
filtered before they are entered into the parameter estimator or the parameter 
update law. However, filtering alone is not sufficient, since it may happen that 
the input signal has only frequencies outside the useful frequency range. (A 
typical case is the system in Example 6.12 with u,(é) = sin 16.09¢.) No amount 
of filtering can remedy such a situation. We are then left with only two options: 
to switch off the estimation or to introduce intentional perturbation signals. 


Effects of Disturbances 


Loss of robustness due to disturbances was found for the MRAS in Section 6.6. 
Similar problems can be encounted for discrete-time systems. In Section 6.5 
direct self-tuning regulators were discussed in the ideal case, in which there 
are no disturbances, but the results can be extended in different directions to 
cover disturbances. Consider the case in which the process is described by 


A(q) x(t) = B(q)u) + v(t) (6.86) 


where v is a bounded disturbance. To get some insight into what can happen, 
first consider an example. (See Egardt (1979).) 
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EXAMPLE 6.16 Bounded disturbances 
Consider the system 


y+ 1) + ay(t) = u(t) + VE +1) 


Use an adaptive control law with A? = Af, = 1. (The desired response is thus 
¥m{t + 1) = u,{t).} The control law is 


u(t) = —O(E)y(#) + ue(t) 


where (®) 
z A y 
= —— 1 
6 + 1) = (E) + 1+ PO e(t + 1) 
e(t +1) = y(t + 1) — 6y(8) — u(t} 
Introduce 
=ô- 8” 
where 6° = —a. The closed-loop system can be described by the equations 
p i P y(Ejv(é + 1) 
eit +1) = ———~ &() + = 
C+D = Tee Ot TE Aw (6.87) 


y(t + 1) = —O(£)y(8) + ue (£) + v(t + 1) 


To show that y(t) may be unbounded, we want to construct a disturbance v and 
a command signal u, such that the parameter error goes to infinity. Assume 
that initial conditions are chosen such that 6(1) = 0 and (1) = 1. Define 


ft) Ê (Veee—1) - 0) (1+) t= 2,3,...,T—5 


for some large T. Choose the following disturbance: 
1 
vt—1 


and the following command signal: 


v(t) =1- + f(t) t= 2,8,...,2—5 


MESS EF) ET 


vt 
The signals v and u, are bounded. A straightforward calculation gives 
6(t) = vt—1 
1 
t) = — 
y(t) Fi 


for i= 1,..., T — 5. Further, let 
v(t) =0 t=T-4,...,T 


0 t=T-4 
u(t 1) = 49 t=T—3,...,T 
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It can then be verified that 9(#) and y(t) for large T are approximately given 
by the following table. 


Now choose v(T + 1) and u,(T) such that 6(7 + 1) = 0 and y(T +1) =1. 
The state vector of Eqs. (6.87) is then equal to the initial state. By repeating 


the procedure for increasing values of T, a subsequence of y(t} will increase as 
—T /2 and therefore is unbounded. o 


Example 6.16 shows that the algorithm may behave badly even if it is as- 
sumed that the disturbances are bounded. Robustness against bounded distur- 


bances can be obtained by using conditional updating as shown in the following 
theorem. 


THEOREM 6.8 Conditional updating 
Consider the plant (6.86) where v is a disturbance that is bounded by 


sup 
t 


<^ 


R 
A,AnB 
where R is the polynomial in the feedback law and CG, is a constant. Assume 
that the direct adaptive algorithm defined by Eqs. (6.41) and (6.42) is used, 
with the modification that the parameters are updated only when the estima- 
tion error is such that 

E orn 

— 2 — max (bo /ro,1) 
Let Assumptions A1—A3 hold, and assume in addition that 0 < bo < 2ro. Then 
the inputs and outputs of the closed-loop system are bounded. (a 


Proofs of this theorem can be found in Egardt (1979) and Goodwin and Sin 
(1984). The modification of the algorithm is referred to as conditional updating 
or introduction of a dead zone in the estimator. 


332 Chapter 6 Properties of Adaptive Systems 


Of course, the result is of limited practical value because it requires an 
upper bound on the disturbance, which is not known a priori. The bound also 
depends on the ratio bp/ro = bo [bo where bo is the instantaneous gain. The 
estimate of this gain is thus essential. If bo/To = 1 and A, = Am = 1, it follows 
that R = B, and the condition for updating becomes 


le(t)| > 2 sup |e(£)| 


This means that the estimate will be updated when the estimation error is 
twice as large as the maximum noise amplitude. 

Another modification of the algorithm also leads to bounded signals. The 
modification consists of using the updating law of Eqs. (6.41) if the magnitude 
of the estimates is less than a given bound and to project into a bounded set 
if Eqs. (6.41) give estimates outside the bounds. We refer to Theorem 4.4 of 
Egardt (1979) for details. This method will, of course, require that the bounds 
on the parameters be known a priori. 


Signal Normalization 


Various modifications of the adaptive algorithm are discussed in more detail in 
Chapter 11. Therefore only a few sketchy remarks are given here. Notice that 
Theorem 6.8 gives stability conditions for adaptive control applied to the model 
(6.86), when v is a bounded disturbance. Unmodeled dynamics can, of course, 
be modeled by Eq. (6.86), but v will no longer be bounded, since it depends on 
the inputs and outputs. By introducing the signal defined by 


Cr(é) = max (|u(Z)|, |y(@®))) 
where C is a stable filter, and introducing the normalized signals 


- » ` u _ v 
y=; i=, v= 
r T Y 


the model of Eq. (6.86) can be replaced by 
Ağ = Bü +0 


where @ is now bounded. By invoking Theorem 6.8, it can be established that 
adaptive control with a dead zone or projection gives a system with bounded 
signals. The detailed justification is complicated. 


The Minimum-Phase Assumption 


In Theorem 6.7 and for the MRAS the process is required to be minimum-phase. 
This assumption is used to conclude that the input signal is bounded when the 
output is bounded. The minimum-variance controller, which cancels the open- 
loop process zeros, cannot be used when the process is nonminimum-phase. 
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Instead, the LQG self-tuner or the moving-average controller with increased 
prediction horizon can be used. 

It should be remarked that sampled data systems often can be non- 
minimum-phase because of “sampling zeros” even if the continuous-time sys- 
tem that is sampled is minimum-phase, These zeros are given by the following 
theorem. 


THEOREM 6.9 Limiting sampled-data zeros 
Let G(s) be a rational function 


(s — 21)(s — 29),..-, (8 — Zm) 
G(s) = K aa 
(8) = Fe pa) (= pah,- (6 Pr) 
and let H (z) be the corresponding pulse transfer function. Assume that m < n. 
As the sampling period h — 0, m zeros of H go to 1 as exp(z;h), and the 


remaining n — m — 1 zeros of H go to the zeros of B,_,(z), where B,(z) is the 
polynomial 


(6.88) 


By (2) = bez*) + BRA? 4... 4 BF (6.89) 
and . 
bÈ = Deyn ( a i=1,...,k (6.90) 
The first polynomials B}, are 
By(z) = 1 
Bo(z}) =z+1 
Ba(z) = 27 +4z+1 o 


This theorem is proved in Åström et al. (1984). It implies that direct 
methods for adaptive control that require that the plant be minimum-phase 
cannot be used with too short a sampling period. When very fast sampling is 
required, a continuous-time representation may then be preferable. Another 
possibility is to describe the system in the delta operator, defined by 


or in Tustin’s operator: 

_i1gq-il 

hqll 
This yields parameterizations that give a much better resolution at g = 1. The 
ô operator gives a description that is equivalent to the g operator description. 
The advantage of the transformation is that the 6 operator description has 
better numerical properties when the sampling is fast. All the poles of the g 
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operator form are clustered around the point g = 1. This gives rise to numerical 
sensitivity. For the 6 operator it can be shown that the limiting value 


mn BH) _ Bol) 
raid Ay(B) ` Ao(5) 


is such that the coefficients in By and Ap are the same as the coefficients 
in the continuous-time transfer function. This implies that the structure of 
the transfer function in the 6 operator is essentially the same as that of 
the continuous-time transfer function, provided that the sampling period is 
sufficiently short. 


The High-Frequency Gain 


For a process that has no right half-plane zeros, the standard direct discrete- 
time algorithm is based on the model 


A Anylt + d) = bo (R*u(t) + S*y(4)) 


where bp is the coefficient of the first nonvanishing term in the B polynomial. 
With some abuse of language this coefficient is called the high-frequency gain 
because it is the first nonvanishing coefficient of the impulse response. For 
continuous-time systems the transfer function of the process is approximately 
G(s) = bos-**. In Theorem 6.7 it was required that the sign of the coefficient 
bo be known. There are several ways to deal with the parameter bp. It may be 
absorbed into R and S and estimated. The polynomial R then has the form 


R(q) = rog G1 pe try 


The problem with this approach is that some safeguards must be taken to avoid 
the estimate rp becoming too small. Another possibility is to introduce a crude 
fixed estimate of bọ. The following analysis shows what happens when this is 
done. Let the true system be 


y(t + 1) = bo (u(t) + y” (2) 8°) 
and let the model be 
y(t + 1) = ro (u(t) +y” (6) = rou(é) + o” (HE 
With zero command signal the control law becomes 
u(t) = —y" (ÊE) 
The equation for parameter updating is 


6(t + 1) = O(t) + P(t + Djo(telt + 1) 
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where 
e(t +1) = y(t +1) = bou(t) + boy” (t)0° 
a b » 
= —boy" (t) (ĉ(£) — 6°) = =r g7 (t) (A(t) — 6°) 
The estimation error is thus governed by 


- b oo 
Ət + 1)= (1 — — P(é+ N99") (t) 
0 
With a pure projection algorithm we have 
— 1 
p7 (tole) 
In this case the matrix in large parentheses has one eigenvalue (1 — bo/ro}) 


and the remaining eigenvalues 1. With least-squares updating, the averaged 
equation for @ becomes 


6 +1)= (2 — = 6(#) 


r 
Hence, to remain stable, it must be required that 
0< bo <2 
ro 
If an algorithm with a fixed ro is used, it is convenient to absorb ro in the 
scaling of the signals. This is discussed in more detail in Chapter 11. When 
the parameter bo is estimated, it can be treated like the other parameters. 


However, because of the special structure of the model it is useful to use special 
algorithms such as the ones discussed in Section 5.8. 


Universal Stabilizers 


An interesting class of adaptive algorithms was discovered during attempts to 
investigate whether Assumption A3 is necessary. The following question was 
posed. Consider the scalar system 


d 
= = ay + bu (6.91) 
where a and b are constants. Does there exist a feedback law of the form 
u = f(ô, y) 


dô (6.92) 
that stabilizes the system for all values of a and b? Morse (1983) suggested 


that there are no rational f and g that solve the problem. Morse’s conjecture 
was verified by Nussbaum (1983), who proved the following result, 
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THEOREM 6.10 Universal stabilizer 

The control law of Eqs. (6.92), with 
f(6,y) = yê” cos 6 
9(6,9)=9 

and 6(0) = 0, stabilizes Eq. (6.91). 

Proof: The closed-loop system is described by 


(6.93) 


dy a a 
de = ay + by6" cos 6 


dé 
Toy 


Since 6(0) = = 0 and dĝ/dt > 0, it follows that Ê(t) is nonnegative and nonde- 
creasing. ô(t) i is also bounded, which is shown by contradiction. Hence assume 
that im; 6(¢) = oo. Multiplication of the differential equation for y by y 
gives 

F; dê 

y® = ay? + by? cos =a Ê + bE cos ô 


Integration with respect to time gives 


` êle) 
F(t) = (0) + 2aô(t) + 2b | x? cosx dx 


Hence io 
È 
KAO] = FO) 15,42 x? cos x dx 
A(t) (2) 8(£) Jo 
But > 
1 f° AB a 2. A 
af x? cos x dx = 6sin6 + 2cos Ô — = sind 
6 Jo 8 
Hence 
1 fi 
lim inf | x*cosxdx =— 
i200 g 0 
This gives 
of O 
=o A(t) 


which is a contradiction because y?/6 is nonnegative. It thus follows that 
lim 6(t) = 0 < œ 
Integration of the equation for Ô gives 


60 = [roa 
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Figure 6.21 Simulation of the control law of Eqs. (6.94) applied to the 
plants (a} G(s} = 1/(1 — s} and {b} G{s) = 1/(s— 1). 


It then follows that 
lim y(ż) = 0 a 


The behavior of a universal stabilizer is illustrated in Fig. 6.21. A reference 
value is used in the simulations, and the control law is then modified to 


f(6,¥) = (uc ~ 6 cos 6 
9(8,y) = (ue — yy? 


Notice that the control law of Eqs. (6. 94) can be interpreted as proportional 
feedback with the gain k = ĝ?cosĝ. The behavior of the control law can be 
interpreted as follows. Sweep over all possible controller gains and stop when 
a stabilizing gain has been found. The function g can be interpreted as the 
rate of change of the gain sweep. The rate is large for large errors and small 
for small errors. The form cos Ê makes sure that the gains can be both positive 
and negative. Universal stabilizers may show very violent behavior. This not 
surprising, since the system may be temporarily unstable during the sweep 
over the gains. 

The control law of Eqs. (6.94) is useful because it does not contain any 
parameters that relate to the system that it stabilizes. It is therefore called 
a universal stabilizer. However, the control law is restricted to a first-order 
system. In attempting to generalize Theorem 6.10 to higher-order systems, the 


(6.94) 
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following question was posed. How much prior information about an unknown 
system is required to stabilize it? This question was answered in a general 
setting by Mårtensson (1985), who showed that it is sufficient to know the 
order of a stabilizing fixed-gain controller. If a transfer function is given, it 
is unfortunately a nontrivial task to find the minimal order of a stabilizing 
controller. 


6.10 CONCLUSIONS 


Analysis of adaptive systems is difficult because they are complicated. A num- 
ber of different methods have been used to gain insight into the behavior of 
adaptive systems. The theory is useful to show fundamental limitations of the 
algorithms and to point out possible ways to improve them. 

In this chapter a basic stability theorem has been derived on the basis 
of standard tools of the theory of difference equations. To show stability and 
convergence, it is necessary to make quite restrictive assumptions about the 
system to be controlled. The consequences of violating these assumptions have 
been analyzed. 

It has been shown that analysis of equilibria and local properties around 
equilibria can be explored by the method of averaging. This method can also 
be applied to investigate global properties. Averaging can be applied in many 
different situations. For deterministic problems it can be used for steps and 
periodic signals. It can also be applied to stochastic signals. Averaging methods 
have also been applied to analyze what happens when adaptive systems are 
designed on the basis of simplified models. To apply averaging methods, it 
is necessary to use small adaptation gains. Unfortunately, there are no good 
methods to determine analytically how small the gains should be. It is also 
demonstrated that adaptive systems may have very complex behavior for large 
adaptation gains. Mechanisms that may lead to instability have been discussed. 
One mechanism is associated with lack of a parameter equilibrium or local 
instability of the equilibrium. Other mechanisms are parametric excitation and 
high adaptation gain. The last two mechanisms can be avoided by choosing a 
small adaptation gain. 


PROBLEMS 


6.1 Consider the indirect continuous-time self-tuning controller in Exam- 
ple 3.6. Collect all equations that describe the self-tuner, and show that 
they can be written in the form 


£ 
F = AW) + Bue 


6.2 


6.3 


6.4 
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p 
2 = x(6) 
do 
Tt = Pope 
dP T 
T = &P — Pog’ P 


Give explicit expressions for al] components of the vectors č, g, 2, and 8 
and the matrix P. 


Consider a system with unknown gain whose transfer function is SPR. 
Show that a closed-loop system that is insensitive to variations in the gain 
is easily obtained by applying proportional feedback. Carry out a detailed 
analysis for the case in which the transfer function is G(s) = 1/(s + 1). 


Consider an MRAS for adjustment of a feedforward gain. Assume that 
the system is designed on the basis of the assumption that the process 


dynamics are 
a 


sta 


(a) Investigate the behavior of the systems obtained with the SPR and 
MIT rules when the real system has the transfer function 


G(s) = 


ab? 
G(s) = —————— 
6) = Gyaje + bp 
Determine in particular which frequency ranges give stable adapta- 
tion rules for sinusoidal command signals. 


(b) Consider the MRAS based on the SPR rule when the reference signal 
is constant and when an additional constant load disturbance is acting 
on the input of the process. Investigate how the load disturbance 
influences the stationary point of the total system. Investigate the 
local stability properties through linearization. 


Consider an MRAS for adjustment of a feedforward gain based on the 
MIT rule. Let the command signal be 


Ue = Q1 80 Myf + a2 81n Wet 
and assume that the process has the transfer function 


1 
(s + 1} 


Derive conditions for the closed-loop system to be stable. 


G(s) = 


Consider Theorem 6.7. Generalize the results to cover the case in which 
the polynomial B* has isolated zeros on the unit circle. 
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6.7 
6.8 


6.9 


6.10 
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Consider the system described by 


y(t) = u(t —d) 


Assume that a direct adaptive control (e.g., with A? = A*, = 1) is designed 
according to the assumption that d = 1. Investigate how this controller 
behaves when applied to a system with d = 2. 


Construct a proof analogous to Theorem 6.7 for continuous-time systems. 


Consider the system 
y(t) = u(t -—N+a 


where a is an unknown constant. Construct an adaptive control law that 
makes y follow a command t, asymptotically. Prove that it converges. 


Consider the continuous-time model 
y(t) = p° (t)6 
Let the parameter 6 be estimated by 


dé L e(t) 
dt ' a +o pH ele) 


where y > 0 and @ > 0 are real constants and 
e(t) = y(t) — p7 (6) 
Assume that y(t) is given by 
y(t) = 97 (280 
Prove that 
\6(t) — 8o] < |6(s) — @o| < |6(0) — Ao} t>s>0 


and that 
ll, 
Va +97 (t)o(t) 
as f —> ©. 


Consider the system in Example 6.12. Interpret the results as if the 
adaptive algorithm tried to estimate parameters a and b in the transfer 
function G(s) = b/(s + a). Use Eqs. (6.70) to show that 


_ 229-310” 
259 — w? 
$= 458 
259 — œ? 
Determine the parameters for @ = 2.72 and œ = 17.03. Explain the 
results by evaluating G(s) for the corresponding frequencies. 
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Figure 6.22 Adaptive feedforward controller in Problem 6.11. 


6.11 A feedforward gain is adapted as shown in the block diagram in Fig. 6.22. 
The model is given by 


dy, 
ar =—Jm + Ue 
The process is not linear, however, but is given by 
dy 


ai =~y~ay +u 


Let y= 1 and u, = 1. 
(a) What are the equilibrium points of the system? 


(b) Linearize the system around the equilibrium points, and determine 
how the stability of the linearized system depends on the parameter 
a. 


(c) Simulate the behavior of the nonlinear adaptive system to verify the 
results in part (b). 


6.12 An integrator process 
1 
G(s) = = 
is to be controlled by the error feedback law 
sU (s) = (4s + @)(U,(s) — Y(s)) 


where U(s), U.({s), and Y(s) are the Laplace transforms of the input, 
reference, and output signals, respectively. The desired response of the 
closed-loop system is given by the transfer function 


4(s + 1) 


Gm(s) = (s + 2)? 
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6.13 


6.14 


6.15 


6.16 


6.17 
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An MRAS has been designed, giving the parameter update law 


do 1 
—— = —yelt} | ——, Uel 
dr 7) (= aye “et ) 

where e = y — Ym- 

(a) Find the equilibrium parameter set of the parameter update law, giv- 
ing a parameter estimate that is constant for any reference input ue (t). 
Give an expression for the averaged nonlinear differential equation of 
the parameter update law. 

(b) Determine the local stability of the equilibrium parameter set for 
a sinusoidal reference signal u,(f) = sin øt by examining the char- 
acteristic polynomial of the linear differential equation obtained by 
linearizing the averaged differential equation around the equilibrium 


parameter set. Determine for what frequencies the linearized equa- 
tion is stable. 


Formulate the averaging equation for a discrete-time algorithm corre- 
sponding to Eq. 6.49. 


Consider discrete-time adaptive control of the system 
y(t + 1) = ay(t) + but(t) 
Derive an MRAS that gives a closed-loop system 


¥m(t + 1) = Arm (t) + bmite(t) 


Use averaging methods to analyze the system when the command signal 
is a step and a sinusoid. 


Consider Problem 6.14. Investigate the behavior of the system when the 
command signal is a step and when there is sinusoidal measurement 
noise. 


Consider the MRAS given by Eqs. (6.57). Investigate the local behavior 
of the closed-loop system when the command signal is a sinusoid and the 
gradient method 


A 


dð _ 
di ~~ yee 
is replaced with a least-squares method of the form 
dé 
ae =P pe 
dP T 
— = —Ppo' P+AP 
di pp r+ 


Show that there is no constant-gain controller that can simultaneously 
stabilize the systems G(s) = 1/(1 +s) and G(s) = 1/(1 — s). 


6.18 


6.19 


6.20 


6.21 


6.22 


6.23 


6.24 


6.25 


References 848 


Show that there is a fixed-gain controller that will simultaneously stabi- 
lize the systems G(s) = 1/(s + 1) and G(s) = 1/{s— 1). 

Consider the MRAS given by Eqs. (6.57). Make a simulation study 
to investigate the consequences of introducing leakage as described by 
Eas. (6.84) and (6.85) in the estimation algorithm. Study sinusoidal] com- 
mand signals as well as step commands and measurement noise. 


Consider the MRAS in Problem 6.4. Make a simulation study to inves- 
tigate the consequences of using conditional updating. Study sinusoidal 
command signals as well as step commands and measurement noise. 


Consider the system in Problem 6.4. Let the input be sinusoidal with 
frequency w. Investigate the effects of sinusoidal measurement noise on 
the system. 


Consider direct algorithms for control of the system 


y(t + 1) = ay(t) + bu(t) 


to give an input-output relation 


Ym{E + 1) = Amy(t) + bmtte(#) 


Investigate by simulation the convergence rates obtained when ô is fixed 
to different values. 


Investigate the behavior of the universal stabilizer in the presence of 
measurement noise. 


Consider a system for adjustment of a feedforward gain based on the MIT 
rule. Let the command signal be u,(é) = sin wt, and let G(s) = 1/(s + 1). 
Simulate the parameter behavior for the MIT rule with adaptation gains 
y = 10 and y = 11. Compare the analysis in Example 6.8. 


Consider the simulation shown in Fig. 6.11, which was performed with 
adaptation gain y = 1.0. Repeat the simulation with different adaptation 
pains. 
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CHAPTER 7 


STOCHASTIC 
ADAPTIVE CONTROL 


7.1 INTRODUCTION 


In earlier chapters the adaptive control problem was approached from a heuris- 
tic point of view. The unknown parameters of the process or the regulator were 
estimated by using real-time estimation, and the estimated parameters were 
then used as if they were the true ones. The uncertainties of the parameter 
estimates were not taken into account in the design. This procedure gives a 
certainty equivalence controller. The model-reference adaptive controllers and 
the self-tuning regulators have been derived under the assumption that the 
parameters are constant but unknown. When the process parameters are con- 
stant, the estimation routines usually are such that the uncertainties decrease 
rapidly after the estimation is started. However, the uncertainties can be large 
at the startup or if the parameters are changing. In such cases it may be im- 
portant to let the control law be a function of the parameter estimates as well 
as of the uncertainties of the estimates. 

It would be appealing to formulate the adaptive control problem from a 
unified theoretical framework. This can be done by using nonlinear stochastic 
control theory, in which the process, its parameters, and the environment are 
described by using a stochastic model. The difference compared with the treat- 
ment in the previous chapters is that the parameters of the process also are 
described by using a stochastic model. The criterion is formulated so as to min- 
imize the expected value of a loss function. It is difficult to find the controller 
that minimizes the expected loss function. Conditions for the existence of an 
optimal controller are not known. However, under the condition that a solution 
exists, it is possible to derive a functional equation by using dynamic program- 
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Nonlinear 
control law 


of hyperstate 


Figure 7.1 Block diagram of an adaptive regulator obtained from stochastic 
control theory. 


ming. This equation, called the Bellman equation, can be solved numerically 
only in very simple cases. The structure of the optimal regulator is shown in 
Fig. 7.1. The controller is composed of two parts: an estimator and a feedback 
regulator. The estimator generates the conditional probability distribution of 
the state given the measurements. This distribution is called the hyperstate 
of the problem. The feedback regulator is a nonlinear function that maps the 
hyperstate into the space of control variables. 

The structural simplicity of the solution is obtained at the price of intro- 
ducing the hyperstate, which can be a quantity of very high dimension. Notice 
that the structure is similar to that of the self-tuning regulator. The self-tuning 
regulator can be regarded as an approximation; the conditional probability dis- 
tribution is replaced by a distribution with all mass at the conditional mean 
value. In Fig. 7.1 there is no distinction between the parameters and the other 
state variables of the process. The regulator can therefore handle very rapid 
parameter variations. Furthermore, the averaging methods based on separa- 
tion of the states of the process and the parameters (used in Chapter 6) cannot 
be used to analyze the system. The optimal control law has an interesting prop- 
erty. The control attempts to drive the output to the desired value, but it will 
also introduce perturbations when the estimates are uncertain. This will im- 
prove the estimates and the future control. The optimal controller achieves a 
correct balance between maintaining good control and small estimation errors. 
This is called dual control. 

The chapter is organized in the following way. The idea with multistep 
decision problems is introduced in Section 7.2, where the two-armed bandit 
problem is introduced. A general stochastic adaptive control problem is for- 
mulated in Section 7.3, and Section 7.4 gives the derivation of the Bellman 
equation. The consequences of the structure of the solution are discussed, and 
the dual property is analyzed. Different ways to approximate the dual con- 
troller are discussed in Section 7.5. However, only very simple examples of 
dual controllers can be solved numerically, but the solutions give some useful 
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indications of how suboptimal controllers can be constructed. Some examples 
are given in Section 7.6, and the stochastic adaptive approach is summarized 
in Section 7.7. 


7.2 MULTISTEP DECISION PROBLEMS 


The idea of decision under uncertainty is discussed in this section. There are 
many situations in which decisions must be taken despite uncertainties about 
the processes or the statistics. One example is route planning, in which the 
traffic will influence the time it takes to get from one point to another. Another 
example is testing of medical drugs. In investigating the effect of a new drug, 
it is necessary to plan the test, but it is also important to have the possibility 
to go back to a standard procedure if the patient is not responding well to 
the new treatment. The characteristic features of these types of problems are 
that there are uncertainties about the possible outcome of different control 
actions. Further, there is a sequence of control actions to be taken. At each 
time, feedback is used to update or change the procedure. One of the first 
stochastic adaptive problems of this kind that was soived can be represented 
by the classical two-armed bandit (TAB) problem. This is a typical problem of 
sequential design of statistical experiments. 

The TAB problem can be described in the following way. A player is faced 
with two slot machines, I and H. If machine I is played the gain is one unit 
with probability p; machine II gives a gain of one unit with probability g. In 
the simplest case, p is known and g is unknown and is chosen before each game 
of length N according to a given probability distribution. During the game the 
unknown quantity g has to be estimated, and the player must decide at each 
step which machine to play to maximize the total gain of each game of N plays. 

The two-armed bandit problem can be used to illustrate the essential ideas 
of multistep decision problems. One strategy that can be used is open-loop 
control, that is, the control sequence is chosen without any measurements 
being made. The decision is taken with respect to the a priori knowledge about 
p and the distribution of g. In the TAB case, machine I should be played if p 
is larger than the mean value of g; otherwise, machine II should be played. A 
second strategy is what is called open-loop optimal feedback (OLOF) control. 
This controller is derived by maximizing the multistep gain function at each 
step under the assumption that no further measurements will be available, 
that is, an open-loop control sequence is determined. The first step in the 
control sequence is then used, and the performance of the system is measured. 
On the basis of the new information (feedback) a new maximization is done 
(compare the receding horizon controller in Chapter 4). The first step in the 
OLOF control is thus the same as the first step in the open-loop control. The 
measurements are thus in the TAB problem used to update the estimate of the 
unknown probability q. 
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To find the optima! solution to the TAB problem, it is possible to use dy- 
namic programming to derive the optimal strategy that maximizes the expected 
gain depending on the outcome of previous plays in the game. How this is done 
is shown for a more general case in Section 7.4. The optimal strategy for a sim- 
ple TAB problem is illustrated in the following example adopted from Yakowitz 
(1969). 


EXAMPLE 7.1 Two-armed bandit problem 


Assume that p = 0.6 and that g is uniformly distributed over the interval 
[0, 1]. If machine I is played all the time, the expected gain is 0.6 per play; 
if machine II is played all the time, the expected gain per play is 0.5. The 
open-loop strategy then suggests that machine I should be played all the time. 
However, for each game of length N there is a probability that the q has a 
larger value than 0.6. If infinitely many plays are available, the player can 
play machine I, estimate q, and then decide which machine to play for the 
rest of the plays. 

To determine the profit of knowledge of g, assume that the player is told the 
value of q before each game. The player’s optimal strategy is then to play the 
machine having the highest probability. In this case the expected gain per play 
is E{max(p, q}} = 0.68, This means that the expected gain can be increased by 
13% compared with the open-loop strategy if q is estimated. Table 7.1 shows 
the average gain per play for different values of N. As the number of plays 
increases, the gain will approach the maximum value 0.68. Relatively many 
plays are needed to get close to the optimum. 

Figure 7.2 shows the state transition diagram for the optimal strategy when 
N = 6. The player is initially in the state (0, 0) and starts to play machine 
Il to find out if machine II has better winning probability than machine Ï. 
The numbers in the circles indicate the number of times machines I and M, 
respectively, have given a gain of one unit. In states (0, 0) and (0, 1) there 
will be a switch to machine I after the player loses once; after state (0, 2) the 
optimal strategy allows one loss before switching to machine I. Oo 


Table 7.1 Average gain per play for different values of N for the two-armed 
bandit problem in Example 7.1. 


N Gain per play 


6 0.62 
10 0.64 
25 0.655 

100 0.6676 
500 0.6755 


1000 0.6773 
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Rest of game on machine I | 


Figure 7.2 The optimal strategy for the two-armed bandit problem in Ex- 
ample 7.1 when N = 6. 


7.3 THE STOCHASTIC ADAPTIVE PROBLEM 


The stochastic adaptive control problem is formulated for a simple class of 
systems by giving the class of models, the criterion, and the admissible control 
strategies. 


The Model 


Consider the discrete-time, single-input, single-output system 
y(t) + ar(t)y(t — 1) +--- + an(t)x(t ~ n) = 

bo(t)u(t —1) +--+ + by-1(Hu(é — n) + eft) (7.1) 
where y, u, and e are output, input, and disturbance, respectively. The noise 
sequence {e(f}} is assumed to be Gaussian with zero mean and variance Ro. 
Further, it is assumed that e(t) is independent of y(t— 1), y(¢—2), ..., u(t—1), 
u(t—2), ..., aj(t), a7(f—1)..., and 8;(t), b;(é-1), ... . It is further assumed that 
bo{t) Æ 0 and that the system is minimum-phase for all £. The time-varying 
parameters 


x(t) = (bo ... baalt) at) ... ant)" (7.2) 
are modeled by a Gauss-Markov process, which satisfies the stochastic differ- 
ence equation 

x(t + 1) = @x(z) + v(t) (7.3) 
where ® is a known constant matrix and {v(t)} is a sequence of independent, 
equally distributed normal vectors with zero mean value and known covariance 
R,. The initial state of the system in Eq. (7.3) is assumed to be normally 
distributed with mean value 

Ex(0) =m (7.4) 
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and covariance 
cov {x(0),x(0)} = Ro (7.5) 


It is assumed that e(t) is independent of v(t) and of x(0). 
The input-output relation of the system of Eq. (7.1) can be written in the 
compact form 
y(t) = g” (t — 1)x(t) + e(t) (7.6) 


where 


g?(¢—1)=(u(t-1) ... ult—n) -y(t-1) ... —x(t-—n)) (7) 
The model is thus defined by Eqs. (7.3) and (7.6). 


The Criterion 


It is assumed that the purpose of the control is to keep the output of the system 
as close as possible to a known reference value trajectory u,{t}. The deviation 
is measured by the criterion 


1 < 2 
Jy=E l 5 (y(t) — ue (£)) | (7.8) 


where E denotes mathematical expectation. This is called an N-stage criterion. 
The loss function should be minimized with respect to u(0), u(1), ..., u(N—1). 
The controller obtained for N = 1 is sometimes called a myopic controller, since 
it is short-sighted and looks only one step ahead. The minimizing controller 
will be very different if N = 1 or if N is large. 


Admissible Control Strategies 


To specify the problem completely, it is necessary to define the admissible 
control strategies. A control strategy is admissible if u(t) is a function of all 
outputs observed up to and including time ¢, that is, y(t), y(t—1), ... all applied 
control signals u(t — 1), ... and the a priori data. Let 9; denote all values of 
the output up to and including y(t) or, more precisely, the c-algebra generated 
by y(£), .-., y(0) and x(0). 


Discussion of the Problem Formulation 


To get a reasonable problem, it is assumed that the noise in Eq. (7.1) is of 
least-squares type, that is, C(q) = q”. Further, there is no extra time delay in 
the system. In the formulation it has been assumed that the measurements 
y(t} are obtained at each sampling interval. It is possible to define other control 
problems leading to other controllers by changing the way in which the future 
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measurements will become available. The realism of the assumption that @ is 
known in Eq. (7.3) is open to question. The case $ = J can, however, be used 
as a generic case to study the dual control problem. The process of Eq. (7.1) is 
a nonlinear model, since the parameters as well as the old inputs and outputs 
are the states of the system. Notice for instance that the distributions of the 
parameters and the disturbances are Gaussian but y(£} is not Gaussian. The 
problem could also be phrased in more general terms by assuming that both 
the model and the criterion are general nonlinear functions. In this chapter we 
consider the special case defined by Eqs. (7.1) and (7.8) to illustrate the ideas 
and the difficulties with the stochastic adaptive approach. 
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We now analyze the problem formulated in Section 7.3. The problem of estimat- 
ing the parameters of Eq. (7.1) is first considered. The control problem is first 
solved for the case in which the parameters are known. The problem is then 
solved for the case in which N = 1 in the criterion of Eq. (7.8). The solution 
of the complete problem is finally discussed. The control problem is solved by 
using dynamic programming. 


The Estimation Problem 


To solve the dual control problem, it is necessary to be able to evaluate the 
influence of the control signal on the future outputs and to estimate and predict 
the behavior of the stochastic parameters. The estimation problem is defined so 
as to compute the conditional probability distribution of the parameters, given 
the measured data. The system is written in a standard state space form, using 
Eqs. (7.3) and (7.6). The conditional distribution of x(t + 1), given 9%, is given 
by the following theorem. 


THEOREM 7.1 Conditional distribution of the states 


Consider the model of Eq. (7.3) with the output defined by Eq. (7.6), where 
e(t) and v(t} are independent zero mean Gaussian variables with covariances 
Rz and Ri, respectively. The initial state of the system is given by Egs. (7.4) 
and (7.5). 

The conditional distribution of x(t), given J_1, is Gaussian with mean £(2) 
and covariance P(t) satisfying the difference equations 


£(E + 1) = R(t) + K (8) (y) — 97 (E— 1/2) 
P(t +1) = (®— K(#)g7(t—1)) P(t)O? + Ri (7.9) 
K(t) = ®P(t)g(t — 1) (Ro + 97 (¢-1) P(t) 9-1) 
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with the initial conditions 

£(0) =m 

P(0) = Ro 
Furthermore, the conditional distribution of y(#), given 9-1, is Gaussian with 
mean value 

my(t} = g” (t ~ 1)2(2) 
and covariance 
02(t) = Ra + 97 (t—1)P()e(t-1) 


Proof: If g(t—1) is a known time-varying vector, then the theorem is identical 
to the Kalman filtering theorem, which can be found in standard textbooks on 
stochastic control. Going through the details of the proof of the Kalman filtering 
theorem, we find that it is still valid, since g(t — 1) is a function of Yı. In 
other words, the vector ¢(¢—1) is not known in advance, but it is known when 
it is needed in the computations. oO 


Remark. Notice that the conditional distribution of y(t}, given %_1, is Gaus- 
sian even if y(t} is not Gaussian. Oo 


The estimation problem is thus easily solved for the model structure chosen. 
The conditional distribution of the state of the system is called the hyperstate. 
The distribution is Gaussian in the problem under consideration. It is then 
sufficient to consider the mean and covariance of x(t). Further, some of the 
old inputs and outputs must be stored to compute the distribution defined 
in Eqs. (7.9). In the problem under consideration the hyperstate is finite- 
dimensional and can be characterized by the triple 


é(t)=(P(%—1) (£) P(t)) (7.10) 
where 
g (t—1)}=(0 u(t—2) ... u(t—n}) —y(t—1) ... —y(t—-n)) (711) 


The vector @? (t—1) is the same as g7 (t—1), except that u(t—1) is replaced by a 
zero. The updating of the hyperstate is given by Theorem 7.1 and the definition 
of @7 (t — 1). In the general case the conditional probability distribution is not 
Gaussian. This will considerably increase the computational difficulties and 
the storage requirements. 


Systems with Known Parameters 


If the parameters of the system of Eq. (7.1) are known, it is easy to determine 
the optima) feedback. The vector ëT defined by Eq. (7.11) is used to show the 
dependence of u(t}: 
y(t +1) = 9? (t)x(t +1) + e(t + 1) 
= bo(t + 1)u(t) + G7 (é)x(t + 1) + e(t + 1) 
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The optimal feedback when bo(t + 1) and x(t + 1) are known is then given by 
_ gt 
u(t) = u(t +1) — @* (t)x(t + 1) 
boft + 1) 
Notice that @() is a function of the admissible data. This controller gives 
y(€+1) =u,(¢ +1) + eft +1) 


and it minimizes Eq. (7.8), since e(t + 1) is independent of Y, and u(t). The 
minimal loss is given by 


(7.12) 


mindy = Re 


Notice that it is necessary to assume that b(t + 1) Æ 0 and that the system is 
minimum-phase at every instant of time. The control signal may otherwise be 
unbounded. 


Certainty Equivalence Control 


When the parameters x(t + 1) are not known, it can be tempting to replace 
Eq. (7.12) with 
— ar 7 
u(t) = Ue{t + 1) g (Hl + 1) 
boft + 1) 

The true parameter values are replaced by the expected values, given 97. The 
controller of Eq. (7.13) is called the certainty equivalence controller. Certainty 
equivalence control is the strategy used in the self-tuning regulators in Chap- 
ters 3 and 4 and in the model-reference adaptive systems in Chapter 5. In 
these controllers it was also necessary to ensure that by # 0. 


(7.13) 


Cautious Control 


We now consider the special case in which N = 1 in Eq. (7.8). According to 
Theorem 7.1 the conditional distribution of y(¢+ 1), given %, is Gaussian with 
mean p? (¢)2(¢ + 1) and covariance Re + p7 (£)P(t + 1)p(t). Then 


E { (y(t + 1) -ult +1)? 1%} 
= (97 (t)2(t + 1) —ue(t + 1)” + 97 EPE + Delt) + Ro 


= (97 (A(t + 1) + bolt + Dutt) — velt + D) 


+ ÕT(E)PE + 1) G(E) +u? (t)po, (t+ 1) 
+ 2u(t)G? (P(t +1) + Re (7.14) 
The first equality is obtained by using the standard formula that 


E(P) =m? +p 
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when ¢ is a Gaussian variable with mean m and variance p. The column vector 
£ selects the first column of the matrix P(z), that is, 

éP=(1 0... 0) 
Further, pa, is the covariance of the parameter estimate 6). Equation (7.14) 


is quadratic in u(t). Minimization of Eq. (7.14) with respect to u(t) gives the 
admissible one-step optimal controller 


) bo(t + Lue (t + 1) — ëT (8) (bot + 1)2(t + 1) + P(é+ 132) 
ud ee l 


B2(t + 1) + Palt + 1) a 
The minimum value of the loss function is 
min E { (y(t + 1) = u(t + 1) 1%} 
= (9" (D(t + 1) — u(t + 1)) + Ro + G7 P(E + HOC) (7.16) 


(bott + 1)ue(t +1) — 67 (t) (Go(t + 1&(t+1) + P(t+ DA) 


The one-step-ahead controller, or cautious controller, of Eq. (7.15) differs from 
Eq. (7.18) because the uncertainties of the parameter estimates are also taken 
into account. The controller becomes cautious when the estimates are uncer- 
tain. Notice that the cautious controller of Eq. (7.15) reduces to the certainty 
equivalence controller of Eq. (7.13) when P(t + 1) =0. 


EXAMPLE7.2 Integrator with time-varying gain 
Consider an integrator in which the gain is changing. Let the process be 
described by 
y(t) — x(t — 1) = b(t)u(t — 1) + e(t) 
where 
b(t + 1) = pab(t) + Riv(t) 


The errors e and v are zero-mean Gaussian white noise with the standard 
deviations R and 1, respectively. Further, it is assumed that ue = 0. The 
certainty equivalence controller is given by 


1 
t) = —.———_ yi E 
u(t) TOEN y(t) 
and the cautious controller is 
blt +1) 


ue) ~ B2(t4+ 1) 4+ pelt +2) 


y(t) 
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The gain in the cautious controller has been reduced by a factor 


52 
b2 + Ps 
compared with the certainty equivalence controller. Notice that the gain ap- 
proaches zero when the uncertainty increases. o 


Multistep Optimization 


The general multistep optimization problem can be solved by using dynamic 
programming. The fact that the conditional distributions are Gaussian will 
simplify the problem. 

It follows from a fundamental result of stochastic control theory (see 
Åström (1970), Lemma 8:3.2) that 


u{t—1) J eu(N- 1) s{ (y(k) — ue(k)) | 
= Ey [ming [5 (Ik) — we(k))? |Y \) 


and it is assumed that the minimum exists. E(- |%-1) is a function of the 
hyperstate of Eq. (7.10) and t. Define 


N 
V (E(t), t) = YO (k) — we(R))? in} 


u(t) ut _u(N—1) ef ~ 


V (č(£), £) can be interpreted as the minimum expected loss for the remaining 
part of the control horizon given the data up to t — 1. 

Consider the situation at time N — 1. When u(N — 1) is changed, only 
y(N) will be influenced. This means that we have the same situation as for 
the one-step minimization. From Eq. (7.16) we get 


V (E(N), N) 
= (ØT(N — 1)(N) —u,(N))” + Re + 67 (N — 1)P(N)G(N — 1) 
_ (Bol ue) -97N — 1) (BNIN) + PENYE) )' 
ESIN) + Pa (N) 
At time N — 1 we get 
V (E(N —1), N~1) 
= min E{(9(N — 1) — ue(N -1)) + V (E(N), N) |%n-a} 


u(N—2) 
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Notice that the minimization is done only over u(N — 2), since u(N — 1) was 
eliminated in the previous minimization. This recursively defines the loss at 
time N — 1, which then can be used for iteration backwards one more step 
of time, and so on. This dynamic programming procedure leads to a recursive 
equation, which defines the minimum expected loss. At time £ we get 


V EC), £) = min E { (9) -u + VEE, t+ 1a} (1) 


This functional equation is called the Bellman equation of the problem. The 
simplicity of the form of Eq. (7.17) is misleading. The equation cannot be 
solved analytically, but it requires extensive numerical computations to get 
the solution even for very simple problems. 

The first term on the right-hand side of Eq. (7.17) can be evaluated in 
the same way as in the one-step minimization. The second term causes the 
difficulties in the optimization, since we have to evaluate 


E{V (E(t + 1), t+ 1) |Y} 


The average with respect to the distribution of y(t), given 4,1, must be com- 
puted. According to Theorem 7.1 this distribution is Gaussian with mean m,(é) 
and variance o7(f). This gives 


E {V (E(t + 1), £+ 1) |91} 


co 
1 zZ 2 

= V (A(t), R(¢ + 1), P +1), t+ 1e Eme) ds (7.18 
zzl (Gt), 2E +1), PE+1),t+1)e s (7.18) 


where 
R(t +1) = ORE) + E(t) (s— g7 (€— 1)2(t)) 
P(t+ 1) = (®—K(He" (t — 1)) P(T + Ri 
K(t) = ®P(t) g(t ~ 1)/o5(4) 
o?(t) = Ro + 97 (t— 1)P()e(t - 1) 


Pilt) = u(t —1) 
i(t) = @1(t — 1) E=2,...,n,N+2,..., 2n 
Øn+1 (t) =8 

These equations, together with Eq. (7.18), can be used to compute recursively 
the control signal and the loss as functions of the hyperstate. The control 
variable u(t — 1) influences the immediate loss (i.e., the first term on the 
right-hand side of Eq. (7.17)). Notice that u(t — 1) also influences the expected 
future loss, since it influences g(¢— 1), which influences £(t+1), P(t+ 1), and 
T(t). This means that the choice of the control signal u(t — 1) influences the 
immediate loss, the future parameter estimates, their accuracy, and also the 
future values of the output signal. The optimal controller is a dual controller. 
It makes a compromise between the control action and the probing action. 
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The probing action will add an active learning feature to the controller, 
in contrast to the cautious and certainty equivalence controllers, in which the 
learning is “accidental.” The optimal feedback will generate control actions 
that will improve the accuracy of the future estimates at the expense of the 
short-term loss. The cautious controller obtained when N = 1 will not benefit 
if probing is introduced; it only tries to make the loss as small as possible at 
the next instant of time. l 


Separation and Certainty Equivalence 


The optimal one-step controller of Eq. (7.15) cannot be obtained by using the 
certainty equivalence principle, but the estimation and the control problems 
can be separated, As was mentioned in Section 7.1, most adaptive controllers 
are based on the hypothesis that the certainty equivalence principle can be 
used. The derivations in this section show that the separation principle also 
can be used in the considered problem. However, the uncertainties must also be 
used in the computation of the control signal. It is thus of interest to investigate 
whether there are classes of systems for which the certainty equivalence and 
separation principles hold. 

One case in which the certainty equivalence principle holds is the cele- 
brated linear quadratic Gaussian case for known systems. For adaptive con- 
trollers there are very few cases to which the certainty equivalence principle is 
applicable. One exception is when the unknown parameters are stochastic vari- 
ables that are independent between different sampling intervals. The certainty 
equivalence principle also holds for the stochastic linear quadratic problem for- 
mulation, when the process noise is white but not necessarily Gaussian and 
when the measurement noise is additive but not necessarily white. 

The separation principle is valid for much more general cases, The cautious 
controller and the dual controller derived in this section are obtained by using 
separation. 


Numerical Solution 


Even in the simplest cases there is no analytic solution to the Bellman equa- 
tion (Eq. 7.17). It is therefore necessary to resort to numerical solution, One 
iteration of Eq. (7.17) involves 


e Discretization of the loss V in the variables of the hyperstate, 

e Evaluation of the integral in Eq. (7.18) using a quadrature formula, and 

e Minimization over u(t — 1) for each combination of the discretized hyper- 
State. 


Both V and u are functions of the hyperstate, so the storage requirements 
increase rapidly when the order of the system increases. Assume that the 
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dimension of the hyperstate is 2 and that each variable is discretized into 
ten steps. Thus the loss and control tables contain 100 values each. Let the 
hyperstate have dimension 6, and let each variable be discretized in ten steps. 
The dimension of the loss and control tables is then 10° each. This means that 
only very simple problems have been solved numerically because of the “curse 
of dimensionality.” 


Discontinuity of the Control Signal 


A feature of the optimal solution is that the control law can become discontin- 
uous in situations such as that shown in Fig. 7.3. The figure shows the loss 
function as a function of the control signal for three different but close values of 
the hyperstate. If there are several local minima, the control signal can become 
discontinuous when the global minimum changes from one local minimum to 
another. This can be interpreted as a change of mode for the controller. For 
instance, the controller may introduce probing to increase the knowledge of 
the unknown parameters. 


7.5 SUBOPTIMAL STRATEGIES 


The optimal multistep dual controller derived in Section 7.4 is of little practical 
use, because the numerical computations limit its applicability. The dual struc- 
ture of the controller is very important, however. Many ways to make practical 
approximations have been suggested; this section surveys some of the possi- 
bilities. The properties of the cautious controller are first investigated, and 
different ways to improve this controller are then discussed. 


Cautious Controllers 


ub 


Figure 7.3 Ilustration of how several local minima of the loss function can 
give a discontinuity in the control signal. The global minima are marked with 
dots. On the middle curve the local minima have the same value. 
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Figure 7.4 Representative simulation when an integrator is controlled by 
using à cautious controller. Turn-off occurs when the control signal is small. 


Minimization over only one step leads to the one-step or cautious controller of 
Eq. (7.15). This controller takes the parameter uncertainties into account, in 
contrast to the certainty equivalence controller of Eq. (7.13). However, the gain 
of Eq. (7.15) will decrease if the variance of dy increases. This will give less 
information about bo in the next step, and the variance will increase further. 
The controller is then caught in a vicious circle, and the magnitude of the 
control signal becomes very small. This is called the turn-off phenomenon. 


EXAMPLE7.3 Turn-off 


Consider the integrator with unknown gain in Example 7.2 with R = 0.09 
and p, = 0.95. Figure 7.4 shows a representative simulation of the cautious 
controller. The control signal is small for periods of time, and the variance of 
the estimated gain increases during the turn-off. After some time the control 
activity suddenly starts again. o 


The turn-off will generally start when the control signal is small and 
when the parameter bo is small. The problem with turn-off makes the cautious 
controller unsuitable for control of systems with quickly varying parameters. 
The cautious controller can be useful if the parameters of the process are 
constant or almost constant, but the certainty equivalence controller with some 
simple safety measures can often be used in such cases also. 
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Classification of Suboptimal Dual Controllers 


The problem of turn-off has led to many suggestions of how to derive controllers 
that are simple but still have some dual features. Some ways are: 

« Adding perturbation signals to the cautious controller, 

s Constraining the variance of the parameter estimates, 

e Extensions of the loss function, and 

« Serial expansion of the loss function. 
Some of these modifications are now discussed. 


Perturbation Signals 


The turn-off is due to lack of excitation (compare Chapter 6). One way to in- 
crease the excitation is to add a perturbation signal. Pseudo-random binary 
sequences (PRBS) and white noise signals have been suggested. The pertur- 
bation can be added all the time or only when the variance is exceeding some 
limit. The addition of the extra signal will naturally increase the probing loss 
but may make it possible to improve the total performance. 


Constrained One-Step Minimization 


One class of suboptimal dual controllers is obtained by constrained one-step 
minimization. Suggested constraints are 

e Limitation of the minimum value of the control signal and 

« Limitation of the variance. 
One method is to choose the control as 


u(t) = { 


This will give an extra probing signal if the cautious controller gives too small 
an input signal. 

Different ways to constrain the minimization by using the P-matrix have 
been suggested. For instance, the one-step loss of Eq. (7.14) can be minimized 
under the constraint that 


Ulim ` sign (Ucautious} if [2Ucantious | < [tum] 


Uecautious if [tcantious | > [tim] 


t P` (t+2)> M 
P~ is proportional to the information matrix. The constraint on the trace of 
P- means that the information about the parameters is always larger than 
some chosen value M. A similar approach is to constrain only the variance of 
bo to 
yalt + 2) if pe (t+ 1) < (E + 1) 


Pp (¢+2)< 
ool ne otherwise 
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These modifications of the cautious controller have the advantage that the con- 
trol signal can be easily computed, but the algorithms will contain application- 
dependent parameters that have to be chosen by the user. Finally, the approx- 
imations will not prevent the turn-off. The extra perturbation is not activated 
until the turn-off occurs. 


Extensions of the Loss Function 


An approach that is similar to constrained minimization is to extend the loss 
function to prevent the shortsightedness of the cautious controller. One obvious 
way is to try to solve the two-step minimization problem. The derivation in 
Section 7.4 shows that it is not possible to get an analytic solution when N = 2 
in Eq. (7.8). 

Another approach is to extend the loss function with a function of P(t + 2), 
which will reward good parameter estimates. The following loss function can 
be used: 


min E {(y(¢-+1) -ult + D) + Pf PEDINA} (7.19) 
where p is a fixed parameter. Since the crucial parameter is bo, we can use 


f (P(t +2)) = Palt + 2) 
or 


_ p, Pht +2) 
This leads to a loss function with two local minima; it is necessary to make a 
numerical search for the global minimum. It is possible to utilize the structure 
of the problem and make a serial expansion up to second order of the loss 
function. The expansion gives a simple noniterative suboptimal dual controller 
in which the increase in computations compared with a self-tuning or cautious 
regulator is very moderate. 
Two similar approaches are to modify the loss functions to 


min E {ot $1)—u,(t+ 1)? — pret Bai (7.21) 
and 
min E { (y(t + 1) — ue(t + 1)) — pe (t+ 1) im} (7.22) 


respectively. The innovation €(f + 1) is defined as 
e(t +1) =9(t +1) -o ORF 1) 


Both these loss functions lead to quadratic criteria that make it possible to 
derive simple analytic expressions for the control signal. 
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Serial Expansion of the Loss Function 


The suboptimal dual controllers discussed above have been derived for the 
input-output model of Eq. (7.1). Suboptimal dual controllers have also been 
derived for state space models. One approach is to make an expansion of the 
loss function in the Bellman equation. Such an expansion can be done around 
the certainty equivalence or the cautious controllers. This approach has mainly 
been used when the control horizon N is rather short, usually less than 10. 
One reason is the quite complex computations that are involved. 


Summary 


There are many ways to make suboptimal dual controllers. Most of the ap- 
proximations that are discussed start with the cautious controller and try to 
introduce some active learning. This can be done by including a term in the 
loss function that reflects the quality of the estimates. This term should also be 
a function of the control signal that is going to be determined. The suboptimal 
controllers should also be such that they can be used for higher-order systems 
without too much computation. 


7.6 EXAMPLES 


Some examples are used to illustrate the properties of the controllers discussed 
in this chapter. 


EXAMPLE74 Optimal dual controller 
The first example is a numerically solved dual control problem from Åström 
and Helmersson (1982). Consider the integrator in Example 7.2. The gain is 
assumed to be constant but unknown, that is, pa = 1 and #, = 0. It is assumed 
that the parameter 6 is a random variable with a Gaussian prior distribution; 
the conditional distribution of b, given inputs and outputs up to time ż, is 
Gaussian with mean (t) and covariance P(t). The hyperstate can then be 
characterized by the triple (y(t), 5(t), P(t)). The equations for updating the 
hyperstate are given by Eqs. (7.9). 
Define the loss function 


HN 
Vy -mined > reiaa 


where Y, denotes the data available at time t, that is, {y(t), y(£— 1), ...}. By 
introducing the normalized variables 


n=yiJ/R, B=b/VP  p=-ub/y 
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it can be shown that Vy depends on 7 and f only. Further introduce the 
normalized innovation 


y(t +1) — y) — but) 
Re + u(t)? P(t) 


For Rz = 1 the Bellman equation for the problem can be written as 


Va (n, P) = min Un(n, BH) 


e(t) = 


where 
Vo(7, B) =0 
and 


Un(0, p, 4) =1+77(1— u)? + a + f Vy-1(Np, Bp)O(s) ds 


where ¢ is the normal probability density with zero mean and unit variance 
and 


Notice that 7, and fp are the one-step-ahead predicted values of 7 and $. 
When the minimization is performed, the control law is obtained as 


EN (N B) = arg min Uxn(N, B, H) 
The minimization can be done analytically for N = 1, giving 


#1(7), $) = arg min € +7- u) + ET) =F rE 


The original variables give 


ee P+D) 
BCED B(t4+1)4+ P(e+1)” 


This control law is the one-step control, or myopic control, derived in Exam- 
ple 7.2, 

For N > 1 the optimization can no longer be done analytically, Instead, 
we have to resort to numerical calculations. Figure 7.5 shows the dual control 
laws obtained for different time horizons N. The discontinuity of the control 
law corresponds to the situation in which a probing signal is introduced to 
improve the estimates. 


u(t) = 


(¢) 
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(a) 


(c) 


Figure 7.5 Illustration of the cautious control and dual control laws when 
(a) N = 1; (b) N = 3; (c) N = 6; and (d) N = 31. The control signal is shown 
as a function of 7/ = n/{(1 +7) and p = P0 + p°). 


The certainty equivalence controller 


u(t) = —y(t)/6 
can be expressed as 
p=1 


in normalized variables. Notice that all control laws are the same for large 
Ê. that is, if the estimate is accurate. The optimal control law is close to the 
cautious control for large control errors. For estimates with poor precision and 
moderate control errors, the dual control gives larger control actions than the 
other control laws. 

The optimal dual controller has been computed on a Vax 11/780. The 
normalized variables 7 and p are discretized into 64 values each. The control 
table and the loss function table are thus of dimension 64 x 64. One iteration 
of the Bellman equation takes about 6 hours of CPU time. o 
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EXAMPLE75 Probing 


An interesting feature of the dual control law is that it behaves quite differently 
from the heuristic algorithms. The most significant feature is the probing 
that takes place to gain more information about the unknown parameters. 
The effect of probing is most significant when the output y is small. Probing 
can be illustrated by using the results of Example 7.4. Both the cautious and 
certainty equivalence control laws are continuous in y and zero for y = 0. 
However, the dual control law is very different. To show this, consider the 
control signal for y = 0—. Figure 7.6 shows the control signal for y = 0— asa 
function of the normalized parameter precision, 8 = b /VP, for different time 
horizons. All control laws give zero control signal when the parameter estimate 
is reasonably precise. However, for uncertain estimates, the contro] signal is 
different from zero, and the transition is discontinuous, This discontinuity can 
be used to define a probing zone. Notice that the probing zone increases with 
increasing time horizon. For N = 31, probing occurs when p < 1.3, that is, 
when 6 < 1.3VP. = 


T EXAMPLE 7.6 | Time-varying parameters 


The system in Examples 7.2 and 7.3 will now be controlled by a suboptimal 
dual controller that minimizes Eq. (7.19), with f(P(t +2)) given by Eq. (7.20). 
(See Wittenmark and Elevitch (1985).) Figure 7.7 shows the same experiment 
using the same noise sequences as in Fig. 7.4. With the suboptimal dual 
controller there is no tendency toward turn-off. The simulation in Fig. 7.7 
shows that the suboptimal] dual controller is much better than the cautious 
controller. Comparisons using Monte Carlo simulations have also been done 
with the numerically computed optimal dual controller. The result is that the 
suboptimal dual controller is as good as the numerically computed optimal 
controller. A summary of some simulations is shown in Fig. 7.8, which shows 


Figure 7.6 Control signal as a function of the normalized parameter preci- 
sion J for optimal control laws for different time horizons. 
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Figure 7.7 Simulation of the integrator with time-varying gain using a 
suboptimal dual controller. Compare Fig. 7.4. 


mean values and standard deviations of the loss when the standard deviation 
of the parameter noise R; is changed. It is assumed that 


pe = V1—-fy 


For v Rı = 0.003 there is good agreement between the suboptimal controller 
and the optimal dual controller that was derived under the assumption that 
Kı = 0. The optimal dual controller from Example 7.4 corresponds to A; = 0. 
The controller obtained has been used also for \/Ry = 0.003. 

In Eg. (7.20), R2/ pa} (t + 1) is used as a normalization factor in the term 
added to the loss function. The reason is an attempt to preserve a property of 
the dual optimal controller. In Example 7.4 the loss Vy was a function of the 
normalized variables 7 and J. The loss function of Eq. (7.19) with Eq. (7.20) 
will also have this property for the integrator example. Simulations indicate 
that the normalization in Eq. (7.20) also makes the choice of p less crucial. O 


7.7 CONCLUSIONS 


Optimal multistep controllers have been derived by using stochastic contral 
theory. The solution is defined through the Bellman equation. This functional 
equation is difficult to solve even for very simple systems. The optimal solution 
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0.003 0.1 0.2 0.3 0.4 0.5 p 


Figure 7.8 The mean values and standard deviations of the losses for Monte 
Carlo runs with different. values of yR, for the system in Example 7.6. (a) 
Cautious controller. (b} Suboptimal dual controller from Wittenmark and Ele- 
vitch (1985). (c) Numerically computed optimal dual controller from Astrém 
and Helmersson (1982). 


has some interesting properties; it makes a compromise between good control 
and good estimation by introducing probing actions. This dual effect is of great 
importance, since it introduces an active learning feature into the controller. 
It is important to preserve this dual feature when suboptimal controllers are 
considered. The cautious or one-step controller does not have any active learn- 
ing; the control may instead be turned off when the parameter uncertainties 
become too large. One important question is whether it is worth the effort 
to look at more elaborate control structures than the certainty equivalence 
controllers. The self-tuning regulators perform very well, as can be seen in 
Chapters 3, 4, and 6. The extra computations are not too extensive in several 
of the suboptimal dual controllers discussed in Section 7.5. This indicates that 
active learning can easily be included. 

There are two situations in which dual control can pay off. One is when 
the time horizon is very short, in which case it is important to get good 
parameter estimates immediately. Areas in which this is the case are economic 
systems and control of missiles. The other situation in which dual features 
are important is when the parameters are varying rapidly and when the bo 
parameter can change sign, as in the simulations given in Section 7.6. Grinders 
are one type of physical process in which the gain may change sign. Grinders 
are common, for instance, in the mining, pulp, and paper industries. 

Even if the optimal] dual controller is impossible to calculate for realistic 
processes, it gives important hints about how to make sensible modifications 
of certainty equivalence and cautious controllers. 
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PROBLEMS 


7.1 


7.2 


7.3 


7.4 


7.5 


7.6 


V7 


Discuss possible difficulties of extending the problem given in Section 7.3 
to the case in which the system in Eq. (7.1) has an additional time delay. 


Show that the cautious controller of Eq. (7.15) minimizes the loss function 
of Eq. (7.14) and that the minimum value of the loss function is Eq. (7.16). 


Consider the process in Example 7.2, but with a constant but unknown 
gain b. Calculate and compare the minimum values of the loss function 
when 

(a) the parameter b is known (ie., the minimum-variance controller), 
(b) the certainty equivalence controller is used, 

(c) the cautious controller is used. 


Compute the suboptimal control law that minimizes the loss function of 
Eq. (7.21). (Hint: See Goodwin and Payne (1977), p. 296.) 


Compute the suboptimal control law that minimizes the loss function of 
Eq. (7.22). (Hint: See Milito et al. (1982).) 


Assume that the process is described by one of the known models 
y(t) = (8; + e(t) i=1,...,m 


but it is not known which is the correct one. Let the initial information be 
described by the probabilities p; = P(@ = 8;}. Formulate the dual control 
problem and discuss the computational difficulties associated with the 
solution. 


Discuss the consequences of formulating the dual control problem for the 
model 
x(t +1) = &(é)x(4) + Tut) 
y(t) = C(t) x(t} + e(t) 
where ®, T, and C contain some unknown parameters. For simplicity, 


consider the case in which the system is given in controllable canonical 
form, that is, 


—ay(t) —ag(f) ... —an(é) 
0 wee 0 
@{t) = 
0 see 1 0 
rr () = (bolt) .-.  bn-1(4)) 


Cié)=(1 0 ... 0) 
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CHAPTER 8 


AUTO-TUNING 


8.1 INTRODUCTION 


Adaptive schemes like MRAS and STR require a priori information about the 
process dynamics. It is particularly important to know time scales, which are 
critical for determining suitable sampling intervals and filtering. The impor- 
tance of a priori information was overlooked for a long time but became appar- 
ent in connection with the development of general-purpose adaptive controllers. 
Several manufacturers were forced to introduce a pre-tune mode to help in ob- 
taining the required prior information. The importance of prior information 
also appeared in connection with attempts to develop techniques for automatic 
tuning of simple PID regulators. Such regulators, which are standard building 
blocks for industrial automation, are used to control systems with a wide range 
of time constants. 

From the user’s point of view it would be ideal to have an auto-tuning 
function in which the regulator can be tuned simply by pushing a button. 
Although conventional adaptive schemes seemed to be ideal tools to provide 
automatic tuning, they were found to be inadequate because they required 
prior knowledge of time scales. Special techniques for automatic tuning of 
simple regulators were therefore developed. These techniques are also useful for 
providing pre-tuning of more complicated adaptive systems. In this chapter we 
describe some of these techniques. They can be characterized as crude robust 
methods that provide ballpark information. They are thus ideal complements 
to the more sophisticated adaptive methods. An overview of industrial PID 
controllers with auto-tuning is given in Section 12,3. 

The chapter is organized as follows: The standard PID controller is dis- 
cussed in Section 8.2. Different auto-tuning techniques are given in Section 
8.3. Transient and frequency response methods for tuning are developed in 
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Sections 8.4 and 8.5, respectively, and Section 8.6 is devoted to analysis of 
relay oscillations. Conclusions are presented in Section 8.7. 


8.2 PID CONTROL 


The PID controllers are the standard tool for industrial automation. The flexi- 
bility of the controller makes it possible to use PID control in many situations. 
The controllers can also be used in cascade control and other controller con- 
figurations. Many simple control problems can be handled very well by PID 
control, provided that the performance requirements are not too high. The PID 
algorithm is packaged in the form of standard regulators for process control 
and is also the basis of many tailor-made control systems. The textbook version 
of the algorithm is 


u(t) = K, [o + 7 J e(s)ds + Ty 3 (8.1) 


0 


where u is the control variable, e is the error defined as e = u, — y where 
te is the reference value, and y is the process output. The algorithm that is 
actually used contains several modifications. It is standard practice to let the 
derivative action operate only on the process output. It may be advantageous 
to let the proportional part act only on a fraction of the reference value. The 
derivative action is replaced by an approximation that reduces the gain at high 
frequencies. The integra! action is modified so that it does not keep integrating 
when the control variable saturates (anti-windup). Precautions are also taken 
so that there will not be transients when the regulator is switched from manual 
to automatic control or when parameters are changed. 

If the nonlinearity of the actuator can be described by the function f, a 
reasonably realistic PID regulator can be described by 


u(t) = f(v(t)) 


v(t) = P(t) + I(t) + DE) (8.2) 
where 
P(E) = Ka (Bult) — 90) 
TF = T luelt) - 0) + FO) — ul) 63) 
Ta dD dy 


The last term in the expression for dI /dt is introduced to get anti-windup when 
the output saturates. This guarantees that the integral part I is bounded. The 
parameter T, is a time constant for resetting the integral action when the 
actuator saturates. The essential parameters to be adjusted are K}, T;, and Tq. 
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The parameter N can be fixed; a typical value is N = 10. The tracking time 
constant is typically a fraction of the integration time 7;. 


8.3 AUTO-TUNING TECHNIQUES 


Several ways to do auto-tuning have been proposed. The most common method 
is to make a simple experiment on the process. The experiment can be done 
in open loop or closed loop. In the open-loop experiments the input of the 
process is excited by a pulse or a couple of steps. A simple process model, for 
instance of second order, is then estimated by using recursive least squares 
or some other recursive estimation method. If a second-order process model 
is estimated, then the PID controller can be used to make pole placement. 
The speed and the damping of the system are then the design parameters. A 
popular design method is to choose the controller zeros such that they cancel 
the two process poles. This gives good responses to setpoint changes, while the 
response to load disturbances is determined by the open-loop dynamics. The 
transient response method for automatic tuning of PID regulators is used in 
products from Yokogawa, Eurotherm, and Honeywell. It is used for pre-tuning 
in adaptive controllers from Leeds and Northrup and Turnbull Control. 

The tuning experiments can also be done in closed loop. A typical example 
of this is the self-oscillating method of Ziegler and Nichols or its variants. The 
relay auto-tuner based on self-oscillation is used in products from SattControl 
and Fisher-Rousemount. In these regulators the tuning is initiated simply by 
pushing the tuning button. One advantage of making experiments in closed 
loop is that the output of the process may be kept within reasonable bounds, 
which can be difficult for processes with integrators if the experiment is done 
in open loop. 

The auto-tuning function is often a built-in feature in standard stand-alone 
PID controllers. Automatic tuning can also be done by using external equip- 
ment. The tuner is then connected to the process and performs an experiment, 
usually in open loop. The tuner then suggests parameter settings, which are 
transferred to the PID controller either manually or automatically. Since the 
external tuner must be able to work with PID controllers from different man- 
ufacturers, it is important that the tuner have detailed information about the 
implementation of the PID algorithm in specific cases. 

Another method for auto-tuning is to use an expert system to tune the 
controller. This is done during normal operation of the process. The expert 
system waits for setpoint changes or major load disturbances and then evalu- 
ates the performance of the closed-loop system. Properties such as damping, 
period of oscillation, and static gain are estimated. The controller parameters 
are then changed according to the built-in rules, which mimic the behavior of 
an experienced control engineer. Pattern recognition or expert system is used 
in controllers from Foxboro and Fenwal. 
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84 TRANSIENT RESPONSE METHODS 


Several simple tuning methods for PID controllers are based on transient. 
response experiments. Many industrial processes have step responses of the 
type shown in Fig. 8.1, in which the step response is monotonous after an 
initial time. A system with a step response of the type shown in Fig. 8.1 can 
be approximated by the transfer function 


G(s) =“ e! (8.4) 


where k is the static gain, L is the apparent time delay, and T is the apparent 
time constant. The parameter a is given by 
L 


a=k F (8.5) 


The Ziegler-Nichols Step Response Method 


A simple way to determine the parameters of a PID regulator based on step 
response data was developed by Ziegler and Nichols and published in 1942. 
The method uses only two of the parameters shown in Fig. 8.1, namely, a 
and L. The regulator parameters are given in Table 8.1. The Ziegler-Nichols 
tuning rule was developed by empirical simulations of many different systems. 
The rule has the drawback that it gives closed-loop systems that are often too 
poorly damped. Systems with better damping can be obtained by modifying 
the numerical values in Table 8.1. By using additional parameters it is also 
possible to determine whether the Ziegler-Nichols rule is applicable. If the 
time constant T is also determined, an empirical rule is established that the 


Figure 8.1 Unit step response of a typical industrial process. 
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Table 8.1 Regulator parameters obtained by the Ziegler-Nichols step re- 
sponse method. 


Controller K, T; Ta 


P l/a 
PI 0.9/a 8L 
PID 1.2/e 2L L/2 


Ziegler-Nichols rule is applicable if 0.1 < L/T < 0.6. For large values of L/T it 
is advantageous to use other tuning rules or control laws that compensate for 
dead time. For small values of L/T, improved performance may be obtained 
with higher-order compensators. It is also possible to use more sophisticated 
tuning rules based on three parameters. 


Characterization of a Step Response 


The parameters k, L, and T can be determined from a graphical construction 
such as the one indicated in Fig. 8.1. It may be useful to take averages of 
several steps if the signals are noisy. There are also methods based on area 
measurements that can be used. One method of this type is illustrated in 
Fig. 8.2. The area Ag is first determined. Then 


T+L=-% (8.6) 


The area A, under the step response up to time T + L is then determined, and 
T is then given by 4 

T= A (8.7) 
where e is the base of the natural logarithm. The essential drawbacks of the 


method are that it may be difficult to know the size of the step in the control 


L+T 


Figure 8.2 Area method for determining L and T. 
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signal and to determine whether a steady state has been reached. The step 
should be so large that the response is clearly noticeable above the notse but 
not so large that production is disturbed. Disturbances will also influence the 
result significantly. 


On-line Refinement 


If a reasonable regulator tuning is obtained, the damping and natural fre- 
quency of the closed-loop system can also be determined from a closed-loop 
transient response. The regulator tuning can then be improved. 


8.5 METHODS BASED ON RELAY FEEDBACK 


The main drawback of the transient response method is that it is sensitive 
to disturbances because it relies on open-loop experiments. The relay-based 
methods avoid this difficulty because the required experiments are performed 
in closed loop. 


The Key Idea 


The basic idea is the observation that many processes have limit cycle oscil- 
lations under relay feedback. A block diagram of such a system is shown in 
Fig. 8.3. The input and output signals obtained when the command signal u, 
is zero are shown in Fig. 8.4. The figure shows that a limit cycle oscillation is 
established quite rapidly. We can intuitively understand what happens in the 
following way: The input to the process is a square wave with frequency ,,. By 
a Fourier series expansion we can represent the input by a sum of sinusoids 
with frequencies @,, 3@,, and so on. The output is approximately sinusoidal, 
which means that the process attenuates the higher harmonics effectively. Let 
the amplitude of the square wave be d; then the fundamental component has 
the amplitude 4d/2z. Making the approximation that all higher harmonics can 


Figure 8.3 Linear system with relay control. 
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Figure 8.4 Input and output of a system with relay feedback. 


be neglected, we find that the process output is a sinusoid with frequency @, 
and amplitude 


4d . 
a= — |G(io,)| 


To have an oscillation, the output must also go through zero when the relay 
switches. Moreover, the fundamental component of the input and the output 
must have opposite phase. We can thus conclude that the frequency w, must 
be such that the process has a phase lag of 180°. The conditions for oscillation 


are thus 

G(i@,)=-2 and Glo.) =Z =} (8.8) 

arg G(i@,) = N= aa ke - 

where K, can be regarded as the equivalent gain of the relay for transmission 
of sinusoidal signals with amplitude a. For historical reasons this parameter 
is called the ultimate gain. It is the gain that brings a system with transfer 
function G(s} to the stability boundary under pure proportional control. The 
period T, = 27 /t, is similarly called the ultimate period. An experiment with 
relay feedback is thus a convenient way to determine the ultimate period and 
the ultimate gain. Notice also that an input signal whose energy content is 
concentrated at @, is generated automatically in the experiment. 


The Ziegler-Nichols Closed-Loop Method 


Ziegler and Nichols have devised a very simple heuristical method for deter- 
mining the parameters of a PID controller based on the critical gain and the 
critical period. The controller settings are given in Table 8.2. These param- 
eters give a closed-loop system with quite low damping. Systems with better 
damping can be obtained by slight modifications of the numbers in the table. 
A modified method of this type is ideally matched to the determination of K, 
and T, by the relay method. This gives the relay auto-tuner shown in Fig. 8.5. 
When tuning is demanded, the switch is set to T, which means that relay feed- 
back is activated and the PID regulator is disconnected. When a stable limit 
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Table 8.2 Regulator parameters obtained by the Ziegler-Nichols closed-loop 


method. 
Controller K. T; Ta 
P 0.5K, 
PI 0.4K, 0.87, 
PID 0.6K, 0.57, 0.127, 


cycle is established, the PID parameters are computed, and the PID controller 
is then connected to the process. Naturally, the method will not work for all 
systems. First, there will not be unique limit cycle oscillations for an arbitrary 
transfer function. Second, PID control is not appropriate for all processes. Re- 
lay auto-tuning has empirically been found to work well for a large class of 
systems encountered in process control. 


The Method of Describing Function 


The approximative method used to derive the conditions for relay oscillations 
given by Eqs. (8.8) is called the method of harmonic balance. We will now 
describe a slight variation of the method that can be used to obtain additional 
insight. This is called the describing function method. It can be described as 
follows: Consider a simple feedback system composed of a linear part with the 
transfer function G(s) and feedback with an ideal relay as shown in Fig. 8.3. 
The conditions for limit cycle oscillations can be determined approximately 
by investigating the propagation of sinusoidal signals around the loop. There 
will be higher harmonics because of the relay, but they will be neglected. 
The propagation of a sine wave through the linear system is described by 
the complex number G(iw). Similarly, the propagation of a sine wave through 


Figure 8.5 Block diagram of a relay auto-tuner. 
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Figure 8.6 Nyquist curve G(iw) and the describing function N (a) for a 
relay. 


the nonlinearity can also be characterized by a complex number N(a), which 
depends on the amplitude of the signal at the input of the nonlinearity. N{a) is 
called the describing function of the nonlinearity. The condition for oscillation 
is then that the signal comes back with the same amplitude and phase as it 
passes the closed loop. This gives the condition 


G(im)N(a) = —1 
This condition can be represented graphically by also plotting the curve N(a) 
in the Nyquist diagram. (See Fig. 8.6.} For the relay the nonlinearity is 
4d 
N(a) = — 
(a) = 2 


because a is the input signal amplitude and the fundamental component of the 
output has amplitude 4d/z. A possible oscillation is at the intersection of the 
curves, The frequency is read from the Nyquist curve and the amplitude from 
the describing function. 


© EXAMPLE 81 Relay oscillation 

Consider a system with relay feedback as in Fig. 8.3 with 
= Ka 

~ s(s+1)(s +a) 


K = 5, a = 10, d = 1, and te = 0. This was the system used to generate 
Fig. 8.4. Simple calculations show that 


G(s) 


1 Du 


arg G(io,) = -5 — tan`! @, — tan” = 
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This implies that the Nyquist curve intersects the negative real axis for @, = 
væ. The approximative analysis thus gives the following estimate of the period: 


i po I ce 
"— Ja va 


Using Egs. (8.8) gives a = 4d|G(iw,,)|/7 = 0.58. From the simulations it can 
be determined that the true values are T, = 2.07 and a = 0.62, which show 
that the describing function method gives fair but not very accurate estimates 
in this example. m 


= 1.99 


Several refinements of the method are useful. The amplitude of the limit 
cycle oscillation can be specified by introducing a feedback that adjusts the 
relay amplitude. A hysteresis in the relay is useful to make the system less 
sensitive to noise. The parameters T, and K, can be used to determine the 
parameters of a PID regulator. The method can be made insensitive to distur- 
bances by comparing and averaging over several periods of the oscillation. 


EXAMPLES8.2 Auto-tuning of cascaded tanks 


The properties of a relay auto-tuner are illustrated by an example. The process 
to be controlled consists of three cascaded tanks. The level of the lower tank is 
measured, and the control variable is the voltage to the amplifier driving the 
pump for the inlet. The signals are noisy. The relay in the auto-tuner has a 
hysteresis, which is determined automatically on the basis of measurements of 
the process disturbances. The relay amplitude is also adjusted automatically 
to keep a specified amplitude of the limit cycle. The limit cycle is judged to 
be stationary by measuring the periods and amplitudes of two positive half- 
periods. Figure 8.7 shows the process inputs and outputs in one experiment, 
illustrating the effect of amplitude adjustment. When the tuning is finished, 


PID control 


0 
0 100 200 s 


Figure 8.7 Results obtained by applying an auto-tuner to level control of 
three cascaded tanks. 


8.6 Relay Oscillations 385 


the regulator is switched to PID control automatically. A change of the setpoint 
shows that the tuning has been successful. O 


Improved Estimates and Pre-tuning 


So far, only two parameters, K, and T, have been extracted from the relay 
experiment. Much more information can be obtained. By changing the setpoint 
during the relay experiment it is possible to determine the static process gain k. 
The product kK, can then be used to assess the appropriateness of PID control 
with Ziegler-Nichols tuning. A common rule is that the Ziegler-Nichols method 
can be used if 2 < kK, < 20. Values that are lower than 2 indicate that a control 
law that admits dead-time compensation should be used. Large values of kK,, 
indicate that improved performance can be obtained with a more complex 
control algorithm. The relay experiment can also be used to estimate a discrete- 
time transfer function by using standard system identification methods. 

The relay method is ideally suited as a pre-tuner for a more sophisticated 
adaptive controller. A model such as Eq. (8.4) is very useful to select the sam- 
pling period and achievable closed-loop response for an MRAS or an STR. It 
provides a PID controller that can serve as a backup controller. If the static 
gain is also determined, the quantity kK, can be used to assess the process 
dynamics. The ultimate period can be used to obtain an estimate of an appro- 
priate sampling period. Parameter estimates that can serve as initial values 
in the recursive parameter estimator can he obtained by applying a parameter 
estimation method to the data from the relay experiments. If an adaptive con- 
troller based on a pole placement design is used, the ultimate period can also 
be used to find appropriate values of desired closed-loop bandwidths. 


8.6 RELAY OSCILLATIONS 


Since limit cycling under relay feedback is a key idea of relay auto-tuning, it is 
important to understand why a linear system oscillates under relay feedback 
and when the oscillation is stable. It is also important to have methods for 
determining the period and the amplitude of the oscillations. Consider the 
system shown in Fig. 8.3. Introduce the following state space realization of the 
transfer function G(s): 


dx 
y= Cx 
The relay can be described by 
d ife>O 
= 8.10 
4 (a ife<0 (8.10) 


where e = u, — y. We have the following result. 
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THEOREM 8.1 Limit cycle period 
Assume that the system defined in Fig. 8.3 and by Eqs. (8.9) and (8.10) has a 


symmetric limit cycle with period T. The period T is then the smallest value 
of T > 0 that satisfies the equation 


C(I +) rT =0 (8.11) 
where 
P = AT? 
and 
T/2 
r= f e“ ds B 
if] 


Proof: Let t, denote the times when the relay switches. Since the limit cycle 
is symmetric, it follows that 


tz41 —t, = T/2 


Assume that the control signal u is d over the interval (t,, t,41). Integration 
of Eqs. (8.9) over the interval gives 


x(t) = x(t) +Td 
Since the limit cycle is symmetric, it also follows that 
x(te+1) = —x(tz) 


Hence 
x(t.) = -U + &) "Td 


Since the output y(t) must be zero at tz, it follows that 
y(t) = Cx(t,) = —C(I + &) "Td = 0 
which gives Eq. (8.11). oO 
Remark 1, The condition of Eq. (8.11) can also be written as 
Hy j2(—1) = 0 (8.12) 


where H7/2(z) is the pulse transfer function obtained when sampling the 
system of Eqs. (8.9) with period 7/2. 

Remark 2. The result that the period is given by Eg. (8.12) also holds for 
linear systems with a time delay, provided that T /2 is larger than or equal to 
the delay. 


Remark 3. Similar conditions can also be derived for relays with hysteresis. 
Oo 
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Comparison with the Describing Function 


Having obtained the exact formula of Eq. (8.11) for T, it is possible to in- 
vestigate the precision of the describing function approximation. Consider the 
symmetric case and introduce h = T/2. The pulse transfer function obtained 
in sampling the system of Eqs. (8.9) with period h is given by 


1< 1 
hy L _ phfa+ing,) ; 
Hale) 3 5 s+ ind, € g ) G(s + ing,) 


where @, = 2m /h. Put sh = iz: 


= 2 Œ% + 2NA 
il) = > aria h) 


—oo 
oo 


Lesa (6(*))=9 


0 


The first term of the series gives 


4 I 4. „27 
H,(—1) = —Im (G g3) = Im G (3) =0 
which is the same result for calculation of T obtained from the describing 
function analysis. This implies that the describing function approximation is 
accurate only if G(s) has low-pass character. An example illustrates determi- 
nation of the period of oscillation. 


EXAMPLE 83 Limit cycle period 


Consider the same process as in Example 8.1. To apply Theorem 8.1, the system 
G(s) is sampled with period k. The pulse transfer function is 


Kh Ka(i—e~*) K (1— eh) 


H;(2) = {z— 1) 7 (a —1)(z-—e*) t a{a —1)(z— e+) 
Hence 
Kh Ka(1—e™) K(1 — e-**) 
eg (a —1)(1+e*)  a(a —1)(14+ e*) 
_ Kh r Ka {l~e L 1T—e = 
~ 2 "a@a-l\ite* lite] 
Numerical search for the value of h that satisfies this equation gives k = 1.035. 
This gives T, = 2.07, which agrees with the simulation in Fig. 8.4. o 


Stable periodic solutions will not be obtained for all systems. A double 
integrator under pure relay control, for example, will give periodic solutions 
with an arbitrary period. 
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8.7 CONCLUSIONS 


In this chapter we have described simple robust methods that can be used 
to get crude estimates of process dynamics. The methods can be used for 
automatic tuning of simple regulators of the PID type or as pre-tuners for 
more sophisticated adaptive control algorithms. Two types of methods have 
been discussed: a transient method based on open-loop step tests and a closed- 
loop method based on relay feedback. 


PROBLEMS 
8.1 Consider a process characterized by the transfer function 
— k =L 
G(s) = l+sT° 
Show that parameters T and L are exactly given by Eqs. (8.6) and (8.7). 
8.2 Consider a process with the transfer function 


G(s) = I (1+sTs) any © 


Show that Eq. (8.6) gives 


T+L=S°T, +L 
k=1 


8.3 Consider a process described by the transfer function 
G(s) = ek 


Determine a proportional regulator that gives an amplitude margin Am = 
2. Show that it is identical to the setting obtained by applying the Ziegler- 
Nichols rule in Table 8.2. 


8.4 Determine the period of the limit cycle obtained when processes with 
transfer functions 


(a) Gs O GS) = Gap +i (c) G(s)=4 


are provided with relay feedback. Use both the approximate and exact 
methods. 


8.5 Consider a process with the transfer function given in Problem 8.3. Deter- 
mine a proportional regulator obtained with the Ziegler-Nichols method 
given in Table 8.2. 
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CHAPTER 9 


GAIN SCHEDULING 


9.1 INTRODUCTION 


In many situations it is known how the dynamics of a process change with the 
operating conditions of the process. One source for the change in dynamics may 
be nonlinearities that are known. It is then possible to change the parameters 
of the controller by monitoring the operating conditions of the process. This idea 
is called gain scheduling, since the scheme was originally used to accommodate 
changes in process gain only. Gain scheduling is a nonlinear feedback of special 
type; it has a linear controller whose parameters are changed as a function 
of operating conditions in a preprogrammed way. The idea of relating the 
controller parameters to auxiliary variables is old, but the hardware needed 
to implement it easily was not available until recently. To implement gain 
scheduling with analog techniques, it is necessary to have function generators 
and multipliers. Such components have been quite expensive to design and 
operate. Gain scheduling has thus been used only in special cases, such as in 
autopilots for high-performance aircraft. Gain scheduling is easy to implement 
in computer-controlled systems, provided that there is support in the available 
software. 

Gain scheduling based on measurements of operating conditions of the 
process is often a good way to compensate for variations in process parameters 
or known nonlinearities of the process. It is controversial whether a system 
with gain scheduling should be considered an adaptive system or not, because 
the parameters are changed in an open-loop or preprogrammed fashion. If we 
use the informal definition of adaptive controllers given in Section 1.1, gain 
scheduling can be regarded as an adaptive controller. Gain scheduling is a 
very useful technique for reducing the effects of parameter variations. In fact 
it is the foremost method for handling parameter variations in flight control 
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systems. There are also many commercial process control systems in which gain 
scheduling can be used to compensate for static and dynamic nonlinearities. 
Split-range controllers that use different sets of parameters for different, ranges 
of the process output can be regarded as a special type of gain-scheduling 
controllers. 

Section 9.2 gives the principle of gain scheduling. Different ways te de- 
sign systems with gain scheduling are treated in Section 9.3, and Section 9.4 
gives a method based on nonlinear transformations. Section 9.5 describes some 
applications of gain scheduling. Conclusions are given in Section 9.6. 


9,2 THE PRINCIPLE 


It is sometimes possible to find auxiliary variables that correlate well with 
the changes in process dynamics. It is then possible to reduce the effects of 
parameter variations simply by changing the parameters of the controller as 
functions of the auxiliary variables (see Fig. 9.1). Gain scheduling can thus be 
viewed as a feedback control system in which the feedback gains are adjusted 
by using feedforward compensation. The concept of gain scheduling originated 
in connection with the development of flight control systems. In this application 
the Mach number and the dynamic pressure are measured by air data sensors 
and used as scheduling variables. 

A main problem in the design of systems with gain scheduling is to find 
suitable scheduling variables. This is normally done on the basis of knowledge 
of the physics of a system. In process control the production rate can often be 
chosen as a scheduling variable, since time constants and time delays are often 
inversely proportional to production rate. (Compare Example 1.5.) 


Gain 
schedule 


Controller 
parameters 


Operating 
condition 
Command 
signal 


Controller Output 


Figure 9.1 Block diagram of a system in which influences of parameter 
variations are reduced by gain scheduling. 
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When scheduling variables have been determined, the controller parame- 
ters are calculated at a number of operating conditions by using some suitable 
design method. The controller is thus tuned or calibrated for each operating 
condition. The stability and performance of the system are typically evaluated 
by simulation; particular attention is given to the transition between differ- 
ent operating conditions. The number of entries in the scheduling tables is 
increased if necessary. Notice, however, that there is no feedback from the per- 
formance of the closed-loop system to the controller parameters. 

It is sometimes possible to obtain gain schedules by introducing nonlinear 
transformations in such a way that the transformed system does not depend 
on the operating conditions. The auxiliary measurements are used together 
with the process measurements to calculate the transformed variables. The 
transformed control variable is then calculated and retransformed before it 
is applied to the process. The controller thus obtained can be regarded as 
being composed of two nonlinear transformations with a linear controller in 
between. Sometimes the transformation is based on variables that are obtained 
indirectly through state estimation. Examples are given in Sections 9.4 and 9.5. 

One drawback of gain scheduling is that it is an open-loop compensation. 
There is no feedback to compensate for an incorrect schedule. Another draw- 
back of gain scheduling is that the design may be time-consuming. The con- 
troller parameters must be determined for many operating conditions, and the 
performance must be checked by extensive simulations. This difficulty is partly 
avoided if scheduling is based on nonlinear transformations. 

Gain scheduling has the advantage that the controller parameters can be 
changed very quickly in response to process changes. Since no estimation of 
parameters occurs, the limiting factors depend on how quickly the auxiliary 
measurements respond to process changes. 


9.3 DESIGN OF GAIN-SCHEDULING CONTROLLERS 


It is difficult to give general rules for designing gain-scheduling controllers. 
The key question is to determine the variables that can be used as scheduling 
variables. It is clear that these auxiliary signals must reflect the operating 
conditions of the plant. Ideally, there should be simple expressions for how the 
controller parameters relate to the scheduling variables. It is thus necessary 
to have good insight into the dynamics of the process if pain scheduling is to 
be used. The following general ideas can be useful: 

e Linearization of nonlinear actuators, 

« Gain scheduling based on measurements of auxiliary variables, 

e Time scaling based on production rate, and 


» Nonlinear transformations. 


The ideas are illustrated by some examples. 
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Figure 9.2 Compensation of a nonlinear actuator using an approximate 
inverse, 


EXAMFLE 9.1 Nonlinear actuator 


Consider the system with a nonlinear valve in Example 1.4. The nonlinearity 
is assumed to be 


v=f(uj=ut u20 


Let f-) be an approximation of the inverse of the valve characteristic. To 
compensate for the nonlinearity, the output of the controller is fed through 
this function before it is applied to the valve (see Fig. 9.2). This gives the 
relation 


v= fu) =f (FO) 
where c is the output of the PI controller. The function f(/-!(c)) should have 
less variation in gain than f. If f- is the exact inverse, then v = c. 


Assume that f(u) = ut is approximated by two lines (see Fig. 9.3): one 
connecting the points (0, 0) and (1.3, 3) and the other connecting (1.3, 3) and 


10 f (u) 
fie) 


0 0.5 1 15 2 


Figure 9.3 The nonlinear valve characteristic v = f(u) = u* and a two-line 
approximation f(u). 
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Figure 9.4 Simulation of the system in Example 9.1 with nonlinear valve 
and compensation using an approximation of the valve characteristic. Com- 
pare Fig. 1.9. 


(2, 16). Then 


. <c<38 
Mo = f ooge O<c< 
0.0538c + 1.139 38<c<16 


Figure 9.4 shows step changes in the reference signal at three different operat- 
ing conditions when the approximation of the inverse of the valve characteristic 
is used between the controller and the valve. (Compare with the uncompensated 
system in Fig. 1.9.) There is considerable improvement in the performance of 
the closed-loop system. By improving the inverse it is possible to make the 
process even more insensitive to the nonlinearity of the valve. a 


Example 9.1 shows a simple and very useful idea to compensate for known 
static nonlinearities. In practice it is often sufficient to approximate the non- 
linearity by a few line segments. There are several commercial single-loop 
controllers that can make this kind of compensation. DDC packages usually 
include functions that can be used to implement nonlinearities. 

The resulting controller in Example 9.1 is nonlinear and should (in its 
basic form) not be regarded as gain scheduling. In Example 9.1 there is no 
measurement of any operating condition apart from the controller output. In 
other situations the nonlinearity is determined from measurement of several 
variables. However, a gain-scheduling controller should contain measurement 
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of a variable that is related to the operating point of the process. Gain schedul- 
ing based on an auxiliary signal is illustrated in the following example. 


EXAMPLE9.2 ‘Tank system 


Consider a tank in which the cross section A varies with height h. The model 
is S 


h 
V =| A(t) dt 
0 
dV dh 
zir AVG, = Gi — ev 20h 


where V is the volume, g; is the input flow, and a is the cross section of the 
outlet pipe. Let q; be the input, and let h be the output of the system. The 
linearized model at an operating point, gf and h°, is given by the transfer 
Function 


SOs s £ a 
where 
oe 22 È _ ayf2gh° 
B= E = TAÇA 2A(h)A 


A good PI control of the tank is given by 


u(t) =K («to E: Z | e(t) dr) 


where 
26@—-—a 
K = =—— 
P 
and 
20 — æ 
E 


This gives a closed-loop system with natural frequency œ and relative damping 
¢. Introducing the expressions for œ and f gives the following gain schedule: 


K = 2f@A(h*) — a, 


p= 2b Gin 
' @ 2A(h°\h° wo? 
The numerical values are often such that œ « 2. The schedule can then be 
simplified to 
K = 2¢wA(h") 


res 
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In this case it is thus sufficient to make the gain proportional to the cross 
section of the tank. D 


Example 9.2 illustrates that it can sometimes be sufficient to measure one 
or two variables in the process and use them as inputs to the gain schedule. 
Often, itis not as easy as in Example 9.2 to determine the controller parameters 
as a function of the measured variables. The design of the controller must then 
be redone for different working points of the process. Some care must also 
be exercised if the measured signals are noisy. They may have to be filtered 
properly before they are used as scheduling variables. 

The next example illustrates that gains, delays, and time constants are 
often inversely proportional to the production rate of the process. This fact can 
be used to make time scaling. 


EXAMPLE 9.3 | Concentration control 


Consider the concentration control problem in Example 1.5. The process is 
described by Eq. (1.3). Assume that we are interested in manipulating the 
concentration in the tank, c, by changing the inlet concentration, Ci. For 4 
fixed flow the dynamics can be described by the transfer function 


1 —ET 
CO) ar 
where 
T=Vnl@ T= Va/q 


If 7 < T, then it is straightforward to determine a PI controller that performs 
well when g is constant. However, it is difficult to find universal values of 
the controller parameters that will work well for wide ranges of g. This is 
illustrated in Fig. 1.11, which shows the step responses of a fixed-gain controller 
for varying flows. Since the process has a time delay, it is natural to look 
for sampled data controllers. Sampling of the model with sampling period 
h = Va/(dq), where d is an integer, gives 


e(kh +h) = ac(kh) + (1 —a)u(kh — dh) 


where 
a= eo Ih] Vm = Vaj (Vma) 


Notice that the sampled data model has only one parameter, a, that does not 
depend on g. A constant-gain controller can easily be designed for the sampled 
data system. 

The gain scheduling is realized simply by having a controller with constant 
parameters, in which the sampling rate is inversely proportional to the flow 
rate. This will give the same response, independent of the flow, in looking at the 
sampling instants, but the transients will be scaled in time. Figure 9.5 shows 
the output concentration and the control signals for three different flows. To 
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Figure 9.5 Output concentration and control signal when the process 
in Example 9.3 is controlled by a fixed digital controller but the sampling 
interval is A = 1/(2q). (a) g = 0.5; (b) g = 1; (c) q = 2. 


implement this gain-scheduling controller, it is necessary to measure not only 
the concentration but also the flow. Errors in the flow measurement will result 
in jitter in the sampling period. To avoid this, it is necessary to filter the flow 
measurement. 

The Ziegler-Nichols transient response method discussed in Section 8.4 is 
based on a model with a time delay and a first-order system. Table 8.1 gives 


_ 09r _ 0.9V¢ 
q 


That is, the integration time is inversely proportional to the flow q. This is the 
same effect as is obtained with the discrete-time controlier when the sampling 
period is inversely proportional to q. O 


In Examples 9.1 and 9.2 it was possible to determine the schedules exactly. 
The behavior of the closed-loop system does not depend on the operating con- 
ditions. In other cases it is possible to obtain only approximate relations for 
different operating conditions. The design then has to be repeated for several 
operating conditions to create a table. It is also necessary to interpolate be- 
tween the values of the table to obtain a smooth behavior of the closed-loop 
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system. This can lead to extensive calculations and simulations before the full 
gain schedule is obtained. 

The gain schedule is usually obtained through simulations of a process 
model, but it is also possible to build up the gain table on-line. This might be 
done by using an auto-tuner or an adaptive controller. The adaptive system 
is used to get the controller parameters for different operating points. The 
parameters are then stored for later use when the system returns to the same 
or a neighboring operating point. 


94 NONLINEAR TRANSFORMATIONS 


It is of great interest to find transformations such that the transformed system 
is linear and independent of the operating conditions. The process in Exam- 
ple 9.3 is one example in which this can be done by time scaling. The obtained 
sampled model is independent of the flow because the time is scaled as 


This means that the key variable is distance traveled by a particle instead of 
time. All processes associated with material flows—rolling mills, band trans- 
porters, flows in pipes, and so on—have this property. 

A system of the form 


=a 


f (x(t) + g (x(£)) e(t) 


can also be transformed into a linear system, provided that all states of the 
system can be measured and a generalized observability condition holds. (Com- 
pare Section 5.10.) The system is first transformed into a fixed linear system. 
The transformation is usually nonlinear and depends on the states of the pro- 
cess. A controller is then designed for the transformed model, and the control 
signals of the model are retransformed into the original control signals. The 
result is a special type of nonlinear controller, which can be interpreted as a 
gain-scheduling controller. Knowledge about the nonlinearities in the model is 
built into the controller. The method with nonlinear transformations 1s illus- 
trated by an example. 


EXAMPLE 9.4 Nonlinear transformation of a pendulum 


Consider the system 
dri 
dt 
Ge sina + u Cos x1 (9.1) 


Your 
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Figure 9.6 The pendulum described by Eqs. (9.1), controlled by (a) the 
nonlinear controller of Eq. (9.3) and (b} the fixed-gain controller of Eq. (9.4). 
The desired characteristic equation (Eq. 9.2} is defined by p, = 2.8 and p, = 4. 


which describes a pendulum, where the acceleration of the pivot point is the 
input and the output y is the angle from a downward position. Introduce the 
transformed contro! signal 


v(t) = — sin x, (t) + u(t) cos x1 (t) 


This gives the linear equations 


dx 0 1 0 
a E a] x+ (3) ” 
Assume that xı and xy are measured, and introduce the control law 
v(t) = —lixı (t) — laxa (t) + m'ue (t) 
The transfer function from u, to y is 


m' 


s* + EA 
Let the desired characteristic equation be 


s° + pis + pe (9.2) 
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which can be obtained with 
l= pe l =p m = pe 
Transformation back to the original control signal gives 


u(t) = PFS AO L ig Creal) — prali) + poult) + sinz (A) 
(9.3) 

The controller is thus highly nonlinear. Figure 9.6 shows the output and the 
control signal when the controller of Eq. (9.8) is used and when a fixed-gain 
controller 

u(t) = —lyx1 (t) — lexe(t) + mue(t) (9.4) 
is used. The parameters lı, la, and m are chosen to give the characteristic 
equation (Eqs. 9.2) when the system is linearized around x, = 7, that is, the 
upright position. 

Notice that Eg. (9.3) can be used for all angles except for xı = 42/2, 
that is, when the pendulum is horizontal. The magnitude of the control signal 
increases without bounds when x; approaches + 27/2. The linearized model is 
not controllable at this operating point. D 


The following example illustrates how to use the method of nonlinear 
transformations for a second-order system. 


EXAMPLE 9.5 Nonlinear transformation of a second-order system 
Consider the system 


dx, 
ae fi(xi, x2) 


dxa _ 
ae fo(x1,%2,%) 
y=xXi 


Assume that the state variables can be measured and that we want to find a 
feedback such that the response of the variable xı to the command signal is 
given by the transfer function 


2 


@ 
Os) = Fy or as 4 a 25) 
Introduce new coordinates zı and ze, defined by 
z1 = x 
dx 
Za = ar = fi(%1, x2) 
and the new control signal v, defined by 
a a 
p= F'(x1,X2,t) = SA f+ sh (9.6) 
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These transformations result in the linear system 


dz, z 
— = 2 
dt (9.7) 
d2 _ 
dt 
It is easily seen that the linear feedback 
v= a? (te — zı) — 26 Wz (9.8) 


gives the desired closed-loop transfer function of Eq. (9.5) from u, to z = x1 
for the linear system of Egs. (9.7). It remains to transform back to the original 
variables. It follows from Eqs. (9.6) and (9.8) that 


F(x1,%2,u) = seh + Sh = (ue — x1) _ 2¢ o fi (x1, x2) 


Solving this equation for u gives the desired feedback. It follows from the 
implicit function theorem that a condition for local solvability is that the partial 
derivative F /Qu is different from zero. o 


The generalization of Example 9.5 requires a solution to the general prob- 
lem of transforming a nonlinear system into a linear system by nonlinear feed- 
back. Conditions and examples are given in the references at the end of this 
chapter. Figure 9.7 shows the general case when the full state is measured. 
There is a nonlinear transformation 


u = 91(x,v) 
z= g(x) 


that makes the relation between v and z linear. A state feedback controller from 
z is then computed that gives v. The control signal v is then transformed into 


State 
feedback 
controller 


Figure 9.7 Block diagram of a controller based on nonlinear transforma- 
tion. 
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the original control signal u. Feedback linearization requires good knowledge 
about the nonlinearities of the process. Uncertainties will give a transformed 
system that is not linear, although it may be easier to control than the original 
system. 

A simple version of the problem also occurs in control of industrial robots. 
In this case the basic equation can be written as 


where J is the moment of inertia, ọ is an angle at a joint, and 7, is a torque, 
which depends on the motor current, the torque angles, and their first two 
derivatives. The equations are thus in the desired form, and the nonlinear 
feedback is obtained by determining the currents that give the desired torque. 
The problem is therefore called the torque transformation. 


9.5 APPLICATIONS OF GAIN SCHEDULING 


Gain scheduling is a very useful method. It requires good knowledge about the 
process and that some auxiliary variables can be measured. A great advantage 
with the method is that the controller adapts quickly to changing conditions. 

This section contains examples of some cases in which it is advantageous to 
use gain scheduling in some of the forms that have been presented above. The 
examples include ship steering, pH control, combustion control, engine control, 
and flight control. 


Ship Steering 


Autopilots for ships are normally based on feedback from a heading measure- 
ment, using a gyrocompass, to a steering engine, which drives the rudder. It is 
common practice to use a control law of the PID type with fixed parameters. 
Although such a controller can be made to work reasonably well, its perfor- 
mance is poor in heavy weather and when the speed of the ship is changed. 
The reason is that the ship dynamics change with the speed of the ship and that 
the disturbances change with the weather. There is a growing awareness that 
autopilots can be improved considerably by taking these changes into account. 
This is illustrated by analysis of some simple models. 

The ship dynamics are obtained by applying Newton’s equations to the 
motion of the ship. For large ships the motion in the vertical plane can be 
separated from the other motions. It is customary to describe the horizontal 
motion by using a coordinate system fixed to the ship (see Fig. 9.8). Let V be 
the total velocity, let u and v be the x and y components of the velocity, and let 
r be the angular velocity of the ship. 
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Figure 9.8 Coordinates and notations used to describe the equations of 
motion of ships. 


In normal steering, the ship makes small deviations from a straight-line 
course. It is thus natural to linearize the equations of motion around the 
solution u = ug, v = 0, r = 0, and ô = 0. The natural state variables are the 
sway velocity v, the turning rate r, and the heading y. The following equations 
are obtained: 


& = (ufan + uar + (u° /l1)bi8 

T = (u/Pjanv + (u/l)azr + (u? /)b28 (9.9) 
dy _ 

dt 


where u is the constant forward velocity and / is the length of the ship. 

The parameters in the state equation (Eqs. 9.9) are surprisingly con- 
stant for different ships and different operating conditions (see Table 9.1). 
The transfer function from rudder angle ô to heading y is easily determined 
from Eqs. (9.9). The following result is obtained: 


__-K(1+sTs) 
Gls) = +e sT (9.10) 
where 
K = Koul (9.11) 


T; = Tol fu i= 1,2,8 


The parameters Ko and Tip are also given in Table 9.1. Notice that they may 
change considerably even if the parameters of the state model do not change 
much. In many cases the model can be simplified to 


G= ra (9.12) 
where 
mTOR] ann 
one (!) 
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Table 9.1 Parameters of models for different ships. 


Ship Mine- Cargo Tanker 
sweeper Full Ballast 
Length (m) 55 161 350 
Gy —0.86 —0.77 —0.45 —0.43 
Cya —0.48 —0.34 —0.43 —0.45 
tai —5.2 —8.39 —4,] —1.98 
Gos —24 —1.63 —0.81 ~—115 
by 0.18 0.17 0.10 0.14 
be —14 —1.63 —0.81 —1.15 
Ko 2.11 —3.86 0.83 5.88 
Tio —8.25 5.66 —2.88 —16.91 
Too 0.29 0.38 0.38 0.45 
Tao 0.65 0.89 1.07 1.43 
üo —0.14 0.19 —0.28 —0.06 
bo —1.4 —1.63 -0.81 —1.15 


This model is called the Nomoto model. Its gain b can be expressed approxi- 


mately as follows: 
uy? Al 
b=e(T) D (9.14) 


where D (in cubic meters) is the displacement, A {in square meters) is the 
rudder area, and c is a parameter whose value is approximately 0.5. The 
parameter a will depend on trim, speed, and loading. Its sign may change 
with the operating conditions. 

A ship is influenced by disturbances due to wind, waves, and currents. The 
effects of these can be described as additional forces. Reasonable models have 
constant, periodic, and random components. The disturbances due to waves 
are typically periodic. The period may vary with the speed of the ship and its 
orientation relative to the waves. 

The effects of parameter variations can be seen from the linearized models 
in Eqs. (9.9), (9.10), and (9.12). First, consider variations in the speed of the 
ship. It follows from Eqs. (9.11) and (9.13) that the gain is proportional to the 
square of the velocity and that the time constants are inversely proportional 
to the velocity. A reduction to half-speed thus reduces the gain to a quarter of 
its value and doubles the time constants. 

The gain is essentially determined by the ratio of the rudder forces to 
the moment of inertia. Thus the relative water velocity at the rudder is what 
determines the gain. This velocity is influenced by waves and currents. The 
relative velocity may decrease drastically when there are large waves coming 
from behind and the ship is riding on the waves. The relative velocity may be 


9.5 Applications of Gain Scheduling 405 


very small or even zero. Controllability is then lost because there is no rudder 
force. The situation is even worse if the waves are not hitting the ship straight 
from behind, because the waves will then generate torques that tend to turn 
the ship. 

The ship dynamics are also influenced by other factors. The hydrodynamic 
forces, and consequently also the parameters a;; and b; in the linearized model 
of Eqs. (9.9), depend on trim loading and water depth. This may be seen 
from Table 9.1, which gives parameters for a tanker under different loading 
conditions. Some consequences of the parameter variations are illustrated by 
an example. 


EXAMPLE 9.6 Ship steering 


Assume that the ship steering dynamics can be approximated by the Nomoto 
model of Eq. (9.12) and that a controller of PD type with the transfer function 


G,(s) = K(1 + sTa) 
is used. The loop transfer function is 


Kb(1 + sTa) 


SASS s(s+a) 


The characteristic equation of the closed-loop system is 


s* 4+ s(a+bKTz)+bK =0 


The relative damping is 


1 a 
c=; (set TaV6K) 
The damping will depend on the speed of the ship. Assume that the model 
of Eg. (9.12) has the values Gmm and bnom at the nominal speed pom. The 
variable tom is the nominal velocity used to design the feedback. Assume that 
u is the actual constant velocity. Using the speed dependence of a and b given 
by Eqs. (9.13) gives 


Uu 


a = Onom 
Unom 


This gives the damping 


= 1 nom u / 
s 2 e $ Unom ag Kran ) 


406 Chapter 9 Gain Scheduling 


Consider an unstable tanker with 


nom = —0.3 

Baom = 9.8 
K=25 
Ta = 0.86 


This gives ¢ = 0.5 and w = 1.4 at the nominal velocity. Furthermore, 


© = Ldu [unom 
¢ = —0.11 + 0.6lu /tnom 


The closed-loop characteristic frequency and damping will thus decrease with 
decreasing velocity. The closed-loop system becomes unstable when the speed 
of the ship decreases to u = 0.17 unom- 

By scaling the parameters of the autopilot according to speed, it is possible 
to obtain closed-loop performance that is less sensitive to speed variations. The 
scaling of the parameters of the controller depends on the control goal. One 
design criterion is time invariance; that is, the time response of the ship should 
always be the same. If true time invariance is desired, the controller gains 
should be inversely proportional to the square of the speed. Path invariance is 
another criterion. In this case the path on the map is always the same. The 
gains should then be inversely proportional to the velocity of the ship. The 
gains are limited at low speed to avoid large rudder motions. Oo 


pH Control 


Control of pH (the concentration of hydrogen ions} is a well-known control 
problem that presents difficulties due to large variations in process dynamics. 
The problem is similar to the simple concentration control problem in Exam- 
ple 9.3. The main difficulty arises from a static nonlinearity between pH and 
concentration. This nonlinearity depends on the substances in the solution and 
on their concentrations. 

The pH number is a measure of the concentration or, more precisely, the 
activity of hydrogen ions in a solution. It is defined by 


pH = — log [H*] (9.15) 


where [H*] denotes the concentration of hydrogen ions. The formula (9.15) is, 
strictly speaking, not correct, since [H*] has the dimension of concentration, 
which is measured in the unit M = mol/l. The correct version of Eq. (9.15) is 
thus pH = —log([H+] fH), where fy is a constant with the dimension liters 
per mole. The formula of Eq. (9.15) will be used here, however, because it is 
universally accepted in textbooks of chemistry. 

Water molecules are dissociated (split into hydrogen and hydroxy] ions) 
according to the formula 

H20 = Ht + 0H- 
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In chemical equilibrium the concentration of hydrogen H* (or rather H30*) 
and hydroxyl OH” ions are given by the formula 


HOR] 
[H20] 


Only a small fraction of the water molecules are split into ions. The water 
activity is practically unity, and we get 


H+] [OHT] = K,, (9.17) 


where the equilibrium constant K,, has the value 107 [(mol/l)”] at 25°C. The 
main nonlinearity of the pH control problem will now be discussed. 


= constant (9.16) 


EXAMPLE 9.7 Titration curve for a strong acid-base pair 


Consider neutralization of ma mol of hydrochloric acid HC] by mg mol of 
sodium hydroxide NaOH in a water solution. The following reaction takes place: 


HCl + NaOH — H* + OH + Nat + C17 
Let the total volume be V. The concentration of chloride ions is then 
[Cr] = xa = ma/V 
and the concentration of sodium ions is given by 
{Na*] = xg = mg/V 


because the acid and the base are completely ionized. Since the number of 
positive ions equals the number of negative ions, it follows that 


xa + [OE] = xg + [H*] 


The concentration of hydroxyl ions can be related to the hydrogen ion concen- 
tration by Eq. (9.17). Hence 


Ku 
[H"] 


[H*] = y 12/4 + Ku —x/2 
[OH] = 4/x2/4 + Ko + x/2 


pH = f(x) = —log (22/4 + Ku -x/2) (9.19) 


x= xp — xa = (OH7)— [H+] = 2% — EH = 10PH-" 10o- PH (9.18) 


Solving for [H*] gives 


This gives 
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f(x) 
10 
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-1-10° 5-107 0 5-10“ 110% 


Figure 9.9 Titration curve of Eq. (9.19) for neutralization of a 0.001 M 
solution of HCI with a 0.001 M solution of NaOH. 


The graph of the function f is called the titration curve. It is the fundamental 
nonlinearity for the neutralization problem. An example of the titration curve 
is shown in Fig. 9.9, which shows that there is considerable variation in the 
slope of the titration curve. The abscissa of the titration curve in Fig. 9.9 is 
given in terms of the concentration difference xg — xa. The x-axis can also be 
recalibrated into the amount of the reagent. 

The derivative of the function f is given by 


P(x) = loge _ 10 log e 
2,/x2/44+K,  10PH-14 4 10-pH 


The derivative has its largest value f’ = 2.2.1068 for pH = 7. It decreases rapidly 
for larger and smaller values of pH. For pH = 4 and 10 we have f = 4.3 - 10°. 
The gain can thus vary by several orders of magnitude. O 


Figure 9.9 shows that the pH of a strong acid that is almost neutralized 
may change very rapidly if only a small amount of base is added. 'The reason 
for this is that strong acids and bases are completely dissociated. A weak acid 
is not completely dissociated, so it can absorb hydrogen ions by converting 
them to undissociated acid. It can also create hydrogen ions by dissociating 
acid molecules. This means that a weak acid or a weak base has an ability to 
resist changes in pH. This property is called buffering. The titration curve of 
a solution that contains weak acids or bases will therefore be less steep than 
the titration curves of strong acids or bases. 

Example 9.7 shows that there will be a severe nonlinearity in the system 
due to the titration curve. An additional example illustrates the difficulties in 
controlling such a system. 


(9.20) 


EXAMPLE98 pH control 


Consider the problem of controlling the pH of an acid effluent that is fed to a 
stirred tank with volume V (in liters) and neutralized with NaOH. Let ca (in 
moles per liter} be the concentration of acid in the influent stream, and let g 


9.5 Applications of Gain Scheduling 409 


(in liters per second) be the flow of the effluent. Let cg (in moles per liter) be 
the concentration of the reagent. Assume that the reagent concentration is so 
high that the reagent flow u (in liters per second) is negligible in comparison 
with q. The system is modeled by a linear dynamic model, which describes the 
mixing dynamics as if there were no reactions, and a static nonlinear titration 
curve, which gives pH as a function of the concentrations. Let x4 and xp be the 
concentrations of acid and base in the tank if there were no chemical reactions. 
Mass balances then give 


d 
de V (9.21) 
adv? y” 


The pH is given by Eq. (9.19). It is further assumed that the dynamics of the 
pH sensor and the pump together can be described by the transfer function 


_ 1 
(1+sT)? 


Asimple calculation indicates the difficulties in the control problem. Assuming 
proportional] control with gain &, the linearized loop transfer function from the 
error in pH to pH becomes 


G(s} = 


_ crk f' 
Gols) = rs sT) FST) 
where Tm is the mixing time constant 
= Vig 


and f’ is the slope of the titration curve given by Eq. (9.20). The critical gain 
for stability is 


2q 
= T T feat 


where the approximation holds for T «< T,,. Since the slope of the titration 
curve varies drastically with pH, the critical gain will vary accordingly. Some 
values for different values of the pH of the mixture are: 


pH Critical gain 


——— (2+T7/T,)(1+ T/T.) S ——— 


€ 


7 0.009 

8 0.046 

9 0.46 
10 4.6 


To make sure that the closed-loop system is stable for small perturbations 
around an equilibrium of pH = 7, the gain should thus be less than 0.009. A 
reasonable value of the gain for operation at pH = 8 is k = 0.01, but this gain 
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0 2 4 6 8 10 
Time 
Figure 9.10 Output pH and control signal when the process in Example 9.8 
is controlled by using a PI controller when pH,,, is (a) 7; (b) 8; (c) 9. 


will give an unstable system at pH = 7 and is too low for a reasonable response 
at pH = 9. Figure 9.10 shows PI control with gain 0.01 and reset time 1. The 
process is started at equilibrium pH = 4. The reference value is then changed 
to 7, 8, and 9. 

The calculations and the simulation illustrate the key problems with pH 
control. The difficulties are compounded by the presence of time delays and 
flow variations. One way to get around the problem is to use the concentration 
x as the output rather than pH. Figure 9.11 shows a possible control scheme 
in which the measured pH and the reference value of pH are transformed into 
equivalent concentrations. This means that the variable x is computed for the 


Figure 9.11 Control configuration for the pH control problem in Exam- 
ple 9.8. 
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0 2 4 6 8 10 
Time 


Figure 9.12 The same experiment as in Fig. 9.10, but with the controller 
structure in Fig. 9.11. The gain of the controller is 1000, and the reset time 


is 1. (a) pHs = 7; (b) pH,es = 8; (c) pH. = 9. 


measured pH by the formula 


x = f-(pH) = 10PH-14 — 19-PH (9.22) 
The transfer function from u to x is 
EB 
g(1+sT7,,)(1+sT) 


which is independent of the operating point. Figure 9.12 shows the same ex- 
periments as in Fig. 9.10, but with the control modification shown in Fig. 9.11. 
It should be noted that the nonlinear compensation with Eq. (9,22) can be 
used, since a strong acid-base pair is controlled. The more general problem 
of mixtures of many weak acids and bases does not have an easy linearizing 
transformation. It is then necessary to measure the concentrations of the com- 
ponents or to make an on-line measurement of the titration curve. Some other 
form of adaptation can then be reasonable. oO 


Combustion Control 


In combustion control of a boiler it is important to adjust the oxygen content 
of the flue gases. The flow of combustion air depends on the burn rate in the 
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Trim update 


Trim 
position 


Figure 9.13 Adaptive feedforward and gain scheduling in an oxygen trim 
controller. 


boiler. The measurement signal is the oxygen content in the exhaust stack, and 
the control signal is the trim position, which controls the flow of combustion 
air. There is a significant time delay between the input to the burner and 
the oxygen sensor in the exhaust stack. With a conventional controller there 
is then a loss of efficiency before the correct trim position is reached after 
a change in the burn rate. One configuration based on adaptive feedforward 
and gain scheduling is shown in Fig. 9.13. The working range of the boiler is 
divided into regions. For each region there is a memory (digital integrator). 
All integrators are zero initially. When the boiler starts to operate, the trim 
control will adjust the oxygen setpoint. When the setpoint level is achieved, the 
appropriate integrator is set to the correct trim position. A trim profile will 
be built up as the boiler works over its range. When the boiler returns to a 
position at which the integrator is set, the stored trim value is instantly fed 
to the trim drive actuator, thus eliminating the lag from the control loop. If 
the fuel changes, the trim profile is updated automatically. The controller thus 
works with an adaptive feedforward compensation from the burn rate. There 
is also a gain scheduling of the loop gain of the controller to get tight control 
under all firing conditions. This gain schedule is built up in commissioning the 
controller. 


Fuel-Air Control in a Car Engine 


A schematic drawing of a microcomputer contro] system for a car engine is 
shown in Fig. 9.14. The accelerator is connected to the throttle valve. The fuel 
injection is governed by a table lookup controller. The control variable, which 
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Accelerator 


Engine speed Exhaust pipe 


Figure 9.14 Schematic diagram of a microcomputer engine control system. 


is the opening time for the fuel injection valve, is controlled by a combination 
of feedforward and feedback. The feedforward signal is a nonlinear function of 
engine speed and load. The load is represented by the air flow, which can be 
measured by using a hot wire anemometer, In one common system the table 
has 16 x 16 entries with linear interpolation. There is also feedback in the 
system from an exhaust oxygen sensor. The fuel-air ratio is measured by using 
a zirconium oxide catalytic sensor called the lambda sond. This sensor gives 


Output voltage V 


0.5 1.0 1.5 
Fuel-air ratio A 


Figure 9.15 The characteristic of a lambda sond, 
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an output that changes drastically when the fuel-air ratio is 1. A typical sensor 
characteristic is shown in Fig. 9.15. The lambda sond is positioned after the 
exhaust manifold in an excess oxygen environment, where the exhaust gas from 
all the cylinders is mixed. This creates a delay in the feedback loop. Notice the 
feedforward path via the table discussed earlier in the paragraph. The feedback 
has a special form; continuous control cannot be used because of the strongly 
nonlinear characteristics of the lambda sond. The error signal is formed by 
normalizing the output of the lambda sensor as follows: 


e-f 1 if V >0.5 
=] if V <05 


The error signal is thus positive if the fuel-air ratio is low (lean mixture) and 
negative when the ratio is high (rich mixture}. The error signal is sent to a PI 
controller whose gain and integration time are set from the scheduling table. 
The values are set on the basis of load (air flow) and engine speed. The gain 


Pitch stick 


Position 


Acceleration 


H M Vas 


Figure 9.16 Simplified block diagram of the pitch control of the autopilot for 
a supersonic aircraft. The highlighted blocks show the parts of the autopilot 
where gain scheduling is used. 
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schedule is implemented simply by adding entries for the gain and integration 
time to the table used for feedforward of the nominal control variable. Because 
of the relay characteristic, there will be an oscillation in the fuel-air ratio. 
This is beneficial, because the catalytic sensor needs a variation to operate 
properly. The amplitude and the frequency of the oscillation are determined by 
the parameters of the controller. 


Flight Control Systems 


Figure 9.16 shows a block diagram of the pitch channel of a flight control 
system for a supersonic aircraft. The pich stick signal is the command signal 
from the pilot. Position, acceleration, and pitch rate are feedback signals. There 
are three scheduling variables: height H, indicated airspeed Vrag, and Mach 
number M. The parameters of the controller that are scheduled are drawn as 
boxes; the arrows indicate the scheduling variables. The schedule for the gain 
Kap is given by 


Kop = Kops + (Kenn — Kobs) MF 


where Kop,,, is a function of indicated airspeed Vras (shown in Fig, 9.17) and 
Kep, is a function of height (also shown in Fig. 9.17). The variable MF is 
given by 

1 


MP= 7I 


Kur 


where Kyr is a function of the Mach number and s is the Laplace transform 
variable. 


KoD ps Kopy 
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Figure 9.17 Scheduling functions. The function Kgp,,, is also different for 
different flight modes. 


416 Chapter 9 Gain Scheduling 


9.6 CONCLUSIONS 


Gain scheduling 1s a good way to compensate for known nonlinearities. With 
such a scheme the controller reacts quickly to changing conditions. One draw- 
back of the method is that the design may be time-consuming if it is not possible 
to use nonlinear transformations or auto-tuning. Another drawback is that the 
controller parameters are changed in open loop, without feedback from the per- 
formance of the closed-loop system. This makes the method impossible to use 
if the dynamics of the process or the disturbances are not known accurately 
enough. 

Example 9.3 and the ship steering example in Section 9.5 show that it 
is often useful to introduce normalized variables. The processes then become 
constant in the new variables, and the gain scheduling of the controllers is 
easily derived. 


PROBLEMS 


9.1 Simulate the tank system in Example 9.2. Let the tank area vary as 
A(h) = Ap +h? 


Further assume that a = 0.1A9. 


(a) Study the behavior of the closed-loop system when the full gain sched- 
ule is used and when the modified gain schedule is used. 


(b) Study the sensitivity of the system to changes in the parameters of 
the process. 


(c) Study the sensitivity of the closed-loop system to noise in the mea- 
surement of the level. 


9.2 Consider the concentration control problem in Example 9.3. Design a 
fixed sampled-data controller with a fixed sampling period for the system. 
Compare it with a controller based on the time-scaled model. 


9.3 A model of a ship is given in Section 9.5. Show that the two scalings 
suggested in Example 9.6 correspond to the time invariance and the path 
invariance behavior of the ship. 


9.4 The simulations in Example 9.8 are done by using the model of Eqs. (9.21) 
and (9.19) with q = 1000, V = 1000, T = 0.1, and K,, = 10714. The 
controller is a PI controller with gain 0.01 and reset time 1. Verify the 
simulations in Fig. 9.10 and Fig. 9.12. 


9.6 Consider the ship steering problem in Example 9.6. Simulate the closed- 
loop system, and determine the sensitivity with respect to the speed of 
the ship. 
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9.6 The controller in Example 9.3 gives a control that is equal when measured 
in terms of the number of sampling intervals but not when measured 
in terms of time. Suggest and test possibilities to get the same time 
responses independent of the flow through the tank. 
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CHAPTER 10 


ROBUST AND 
SELF-OSCILLATING SYSTEMS 


10.1 WHY NOT ADAPTIVE CONTROL? 


In previous chapters we showed that adaptive control can be very useful and 
can give good closed-loop performance. However, that does not mean that adap- 
tive control is the universal tool that should always be used. A control engineer 
should be equipped with a variety of tools and the knowledge of how to use 
them. A good guideline is to use the simplest control algorithm that satisfies the 
specifications. Robust high-gain control should definitely be considered as al- 
ternatives to adaptive control algorithms. Section 10.2 treats robust high-gain 
control. The self-oscillating adaptive system (SOAS) is presented in Section 
10.3. This is a special class of adaptive systems with strong ties to high-gain 
control and auto-tuning. Relay feedback is a key ingredient of the SOAS. An- 
other class of switching systems, variable-structure systems, is discussed in 
Section 10.4. Variable-structure systems have been developed mainly in the 
Soviet Union and can be regarded as a generalization of the SOAS. Conclu- 
sions are given in Section 10.5. 


10.2 ROBUST HIGH-GAIN FEEDBACK CONTROL 


Some design methods deal explicitly with process uncertainties. One powerful 
method has been developed by Horowitz. This procedure, which has its origin 
in Bode’s classical work on feedback amplifiers, is based on several ideas. The 
specifications are expressed in terms of the transfer function from command 
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Figure 10.1 A two-degree-of-freedom system. 


signal to process output. The plant is characterized by its nominal transfer 
function. For each frequency it is also assumed that the process uncertainty is 
known in terms of variations in amplitude and phase. A solution is determined 
in terms of a controller with a feedback Gp, and a feedforward Gyr, as shown in 
Fig. 10.1. Such a configuration is called a two-degree-of-freedom system because 
there are two transfer functions to be determined. 

Several other design methods can be used to design robust controllers. One 
technique is based on LQG design. By adjusting the weighting matrices in the 
LQG problem, a loop transfer recovery (LTR) is achieved. This design procedure 
can cope with phase uncertainty at high frequencies. The key idea is to keep 
the loop gain less than 1 at high frequencies, where the phase error is large. 

In Horowitz’s procedure the feedback transfer function Grp is first deter- 
mined such that the closed-loop uncertainty is within the specified limits. The 
nominal value of the transfer function is then modified by the feedforward 
compensation Gy. The method is based on graphical constructions using the 
Nichols chart. It gives a high-order linear compensator that can cope with the 
specified plant uncertainty. The procedure attempts to keep the loop gain as 
low as possible. A key idea in the Horowitz design method is the observation 
that a system in which the Nyquist curve is close to a straight line through 
the origin can tolerate a significant change of gain. The response time will 
change with the gain, but the shape of the response will remain invariant. For 
minimum-phase systems with a known pole excess it is always possible to find 
a frequency range in which the phase is constant. By proper compensation it 
is then possible to obtain a loop gain at which the Nyquist curve is close to a 
straight-line segment. The assumption that the pole excess is known implies 
that the phase of the system is known for high frequencies. This is not always 
a realistic assumption. The Horowitz design method was originally developed 
for structured perturbations but has also been extended to unstructured un- 
certainties. 

The main step in the procedure is to determine the tolerances for the gain 
in the closed-loop transfer function. The plant uncertainties are specified as 
gain and phase variations of the plant transfer function at different frequen- 
cies. The given tolerances and uncertainties are used to calculate constraints 
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for the open-loop transfer function. The feedback compensator Gy, is then de- 
signed such that the compensated open-loop system satisfies the tolerances. 
This is usually an iterative procedure, which can conveniently be done graph- 
ically by using a Nichols chart. Finally, the prefilter Gyr is designed such that 
the closed-loop specifications are fulfilled. This may be done by using the Bode 
diagram. l 

The major drawback of the method is that it is impossible to know a priori 
whether the desired closed-loop specifications are attainable. It is thus a trial- 
and-error method, but the iterations give the designer insight into the tradeoffs 
between different specifications, such as closed-loop sensitivity, complexity of 
the controller, and measurement noise amplification. 


EXAMPLE 10.1 An industrial robot arm 


A simple model of a robot arm is used in this example. The transfer 
function from the control input (motor current J) to measurement output 
(motor angular velocity @) is 


km (Jas? + ds +k) 
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Figure 10.2 Bode plots for the robot arm in Example 10.1 for J, = 0.0002 
and J, = 0.002. 
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with J, € [0.0002,0.002], Jm = 0.002, d = 0.0001, k = 100, and km = 
0.5. The moment of inertia J, of the robot arm varies with the arm angle. 
Bode plots of the plant gain for the extreme values of the arm inertia J, are 
given in Fig. 10.2. The purpose of the control system is to control the angular 
velocity step responses at various arm angles. The aim is to get a closed-loop 
system with a bandwidth between 15 and 40 Hz. The disturbance rejection 
specification has been set to 6 dB. A feedback compensator that satisfies the 
specifications is 

125(1 + s/50)(1 + s/300) 


Gres) = 5 s/800)(1 + 5/8000) 


This compensator is essentially a PI controller with a lead filter. The final 
prefilter has the transfer function 


1+ s/1000 


Grrl) = F s7200 + s/200)(4 + 5/200) 


Simulated responses are shown in Figs. 10.3 and 10.4. 

To make a comparison, an adaptive controller is also designed for the pro- 
cess. In this particular problem the essential uncertainty is in one parameter 
only, the moment of inertia. It is then natural to try to make a special adaptive 
design in which only this parameter is estimated, 
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Figure 10.3 Simulation of the step response with the arm inertia J, = 
0.0002 for the robust system. 
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Figure 10.4 Simulation of the step response with the arm inertia J, = 0.002 
for the robust system. 


The adaptive controller is designed on the basis of a simplified model. If 
we neglect the elasticity in the robot arm, the system can be described by 


J = kml (10.1) 


where J = J, + dm is the total moment of inertia and km the current gain 
of the motor. The plant of Eq. (10.1) can be controlled adequately with a PI 
controller. The controller parameters can be chosen to be 


K= Boo] 
km 
n žo 
ao 


This gives the following characteristic equation for the closed-loop system: 
s” + 2o08 + w% = 0 


The controller parameters are thus related to the model by simple equations. 
Notice that the integration time 7; does not depend on the moment of inertia 
of the robot arm and that the controller gain K should be proportional to the 
moment of inertia. 
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A root-locus calculation indicates that the design based on the simplified 
model will work well if 


kd m \ Ut? 
#) 


a 


09 < Wis =o ( 


The most critical case occurs for J, = 0.002. It implies that wo must be less 
than 200 rad/s. 

The fact that the design is based on a simplified model limits the closed- 
loop bandwidth. A fast response to command signals can still be obtained by 
use of feedforward compensation. For this purpose, let the desired response to 
angular velocity commands be given by 
Om 


Gm (s) = i 
(s) S? + 20 00ms + OÈ, 


The feedforward controller can now be designed such that the closed-loop 
system gets the desired response. 

An adaptive system can be obtained simply by estimating the total moment 
of inertia by applying recursive least squares to the model of Eq. (10.1) and 
feeding the estimate into the above design equation. To estimate the param- 
eters of the continuous-time model of Eq. (10.1), it is necessary to introduce 
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Figure 10.6 Simulation of the tailored adaptive systems response with the 
arm inertia J, = 0.0002. The controller is initially tuned for J, = 0.002. 
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Figure 10.6 Simulation of the tailored adaptive systems response with the 
arm inertia J, = 0.002. The controller is initially tuned for J, = 0.0002. 


filtering. This is done by integrating Eq. (10.1) over the time interval (t, t+h): 
k tth 
o(t +h) -ol = "2 f I(s) ds 
t 


A least-squares estimator of J is easily constructed from this equation. This 
estimate is then used in the PI control law. Simulations of the system are shown 
in Figs. 10.5 and 10.6. The parameter h was chosen to be 0.1 s. The figures 
show that the system adapts to a good response after two transients. Notice 
the magnitudes of the control signal for the cases of low and high inertia. 
The controller structures for the robust and adaptive cases are quite sim- 
ilar by design. The feedback part of the robust controller is essentially a PI 
controller with a lead-lag filter. The parameters are K = 2.5 and T; = 0.02. 
The lead-lag filter increases the controller gain to 6.7 at frequencies around 
500 rad/s. The feedback part of the adaptive controller is also a PI controller, 
but the parameters are adjustable. They range from K = 0.15 and 7; = 0.07 
for J, = 0.0002 to K = 1.05 and 7; = 0.07 for J, = 0.002. The feedback gain 
in the adaptive controller is thus 40 times smaller than the gain of the robust 
controller. This means that the effects of measurement noise are also much 
smaller for the adaptive controller. Both systems are designed to give the same 
response time to command signals. Notice, however, that feedforward is used 
in very different ways in the two systems. In the robust design, it is used to 
decrease the response time to command signals; in the adaptive design, it is 
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used to increase the response time. The reason is that the bandwidth of the 
closed inner loop is large in the robust design, to take care of the plant vari- 
ations, whereas the adaptive design allows a low closed-loop bandwidth, since 
the uncertainty is eliminated. The responses of the adaptive system are better 
over the full parameter range when the parameters are adapted, but it will 
take some time for the parameters to adapt. The robust controller will have a 
better response when the parameters of the process are changing rapidly from 
one constant value to another. Oo 


Comparison between Robust and Adaptive Control 


The robust design method will generally give systems that respond more 
quickly when the parameters change, but it is important that the range of 
parameter variation be known. The adaptive controller responds more slowly 
but can generally handle larger parameter variations. The adaptive controller 
will give better responses to command signals and load variations when con- 
troller parameters have converged, provided that the model structure is suffi- 
ciently correct. The controllers designed by Horowitz’s method will generally 
have high-loop gains, which make them more sensitive to noise. 


10.3 SELF-OSCILLATING ADAPTIVE SYSTEMS 


A system that is insensitive to parameter variations can be obtained by using 
a two-degree-of-freedom configuration with a high-gain feedback and a feedfor- 
ward compensator (compare Section 10.2). This section introduces an adaptive 
technique to keep the gain in the feedhack loop high by using a relay feedback. 
Relays combine the properties of high gain and inexpensive implementations. 
However, relays often introduce oscillations into the system. 

The idea of the self-oscillating adaptive system (SOAS) originated in work 
at Honeywell on adaptive flight control in the late 1950s. The inspiration 
came from work on nonlinear systems by Fliigge-Lotz at Stanford. Systems 
based on the idea were flight-tested in the F-94C, the F-101, and the X-15 
aircraft. (See Fig. 1.2.) The idea has also been applied in process control, 
but the SOAS has not found widespread use. One reason is that substantial 
modifications of the basic scheme are necessary to make the systems work well. 
A characteristic feature of the SOAS is that there is a limit cycle oscillation. The 
system thus represents a type of adaptive control in which there are intentional 
perturbations, which excite the system all the time. The SOAS is one of the 
simplest systems with this property. The SOAS is based on three useful ideas: 
model-following, automatic generation of test signals, and use of a relay with 
a dither signal as a variable gain. The key result is that the loop gain is 
automatically adjusted to give an amplitude margin Am = 2. 
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Figure 10.7 Block diagram of a self-oscillating adaptive system (SOAS). 


Principles of the SOAS 


Since we want to emphasize the ideas, we will limit the discussion to the basic 
version of the system. A block diagram of an SOAS is shown in Fig. 10.7. This is 
a two-degree-of-freedom system. There is a high-gain feedback loop around the 
process. The desired response to command signals is obtained by the reference 
model. Ideally, the high-gain loop will make the process output y follow the 
model output ym. The response of the closed-loop system will be relatively 
insensitive to the variations in process dynamics because of the high loop gain. 
The system is thus a typical model-following design. The special feature is 
that the high-gain loop is nonlinear. The high-gain loop is supplemented with 
a feedforward model and a device to change the gain of the relay. The model 
determines the closed-loop response and the gain changer limits the amplitude 
of the limit cycle oscillation. 


The High-Gain Loop 


The feedback compensator contains a lead filter G;(s) and a relay. The relay 
is motivated by the desire to have as high a gain as possible. Because of 
the relay, there will be a limit cycle, whose amplitude is kept at a tolerable 
limit by adjusting the relay amplitude by a separate feedback loop. The relay 
gives a high gain for small inputs, and the gain decreases with increasing 
input amplitude. The key difficulty in the design of an SOAS is to find a 
suitable compromise between the limit cycle amplitude and the response speed. 
A low relay amplitude gives a limit cycle with a low amplitude but also a slow 
response speed. A large relay amplitude gives a rapid response but also a large 
amplitude of the limit cycle oscillation. The relations can to some extent be 
influenced by the lead filter. 
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Properties of the Basic SOAS 


The limit cycle in a system with relay feedback was discussed in Section 8.6. 
This will now be used to analyze the self-oscillating adaptive system. Consider 
the system shown in Fig. 10.7 without the gain changer. 

The relay is used to introduce a limit cycle oscillation in the system. The 
period and the amplitude of the oscillation can be determined by the methods 
discussed in Section 8.6. When the reference signal is changed, or when there 
are disturbances, there will also be other signals in the system, which will be 
superimposed on the limit cycle oscillations. The signals that appear in the 
system will thus be of the form 


s(t) = asin at + b(t) 
where asin wt denotes the limit cycle oscillation. The key to understanding 
the SOAS is to find out how signals of this type propagate in the system. It is 
straightforward to determine the transmission of the signal through the linear 
subsystems; the signal propagation through the relay is the main difficulty. 
This analysis will be simplified considerably if it is assumed that b(t) varies 
much more slowly than sin @t. Furthermore, assume that b(t) is smaller than 


a. This should be true at least in steady state, since b(t) is the difference 
between the model output and the process output. 


The Dual-Input Describing Function 


It is assumed that b(t) varies so slowly that it can be approximated by a 
constant. The input signal to the relay is thus of the form 


u(t) = a sin @t + b 
The relay input and output are shown in Fig. 10.8. The relay output can be 
expanded in a Fourier series 
y(t) = Ng +aNa sin wt + aNa, 80 20t + --- (10.2) 


where the numbers Na and Np are given by 


2a 
_ 1 _ d(x +a) —d(x ~— 2a) + ad 
Ne = ob f y(t) dt = 2b 
0 


4ad 2ad 24 sin (È) 
= ——_ = Sın 


— 2b xb xb 


17 ad T 
Na =<, | x@sinotat = Ta f sin wt dt 
0 a 


4d 4d 
= = — 2 
= Fg etre 1 — (b/a) 
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Figure 10.8 Relay inputs and outputs. 


Small values of b/a give the approximations 
4d 
Nsw — kadad 
4” ra PYT 


Notice that 
Na œ 2Ng (10.3) 


The transmission of the constant level 6 and of the first harmonic sin œt are 
thus characterized by the equivalent gains Ng and N4. Since the linear parts 
will normally attenuate high frequencies more than low frequencies, a rea- 
sonable approximation is often obtained by considering only the constant part 
and the first harmonic. The number Ng, which describes the propagation of a 
constant signal, is called the dual-input describing function, by analogy with 
the ordinary describing function that describes the propagation of sinusoids 
through static nonlinearities. Notice that the describing function Ng depends 
on a (the amplitude of the sinusoidal oscillation). This dependence is the key 
to understanding how the SOAS works. The dual-input describing function can 
be used to characterize the transmission of slowly varying signals. A detailed 
analysis of the accuracy of the approximation is fairly complicated. Let it there- 
fore suffice to mention some rules of thumb for using the approximation. The 
ratio a/b should be greater than 3, and the ratio of the limit cycle frequency to 
the signal] frequency should also be greater than 3. It is strongly recommended 
that the analysis be supplemented by simulation. 


Main Result 


The tools for explaining how the SOAS works are now available. Consider the 
system in Fig. 10.7. From Section 8.5 the period of the limit cycle is given by 
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Eqs. (8.8) when the describing function method is used. The amplitude of the 
limit cycle at the relay input is also given by Eqs. (8.8): 


NalG(ia.)| = 1 (10.4) 


The transmission of a sinusoidal signal through a relay can thus be approxi- 
mately described by an equivalent gain, which is inversely proportional to the 
signal amplitude at the relay input. The amplitude thus automatically adjusts 
so that the loop gain is unity at the frequency @,,. 

Now consider the propagation of slowly varying signals superimposed on 
the limit cycle oscillations. The propagation of the signals through the linear 
parts of the system can be described by the transfer function G(s). If the signals 
vary slowly in comparison with the limit cycle oscillations, the propagation 
through the relay is approximately described by the dual-input describing 
function Ng. The propagation of slowly varying signals is thus approximately 
described by the loop transfer function 


Go(s) = Ns (a)G (s) 
It follows from Eqs. (10.3) and (10.4) that 


|Goliou)] = Na(a)|G(io.)| = Z NalG(io,)] = 0.5 


We thus obtain the following important result, which describes the operation 
of the SOAS. 


RESULT 10.1 Amplitude margin of the SOAS 


The SOAS automatically adjusts itself so that the response to reference signals 
is approximately described by the closed-loop transfer function 


_ _kG(s) 
Gels) = FRG 
where the gain & is such that the amplitude margin is 2. Oo 


This result explains the adaptive properties of the SOAS. The result can 
also be stated in the following way: The relay acts as a variable gain. The 
magnitude of the gain depends on the amplitude of the sinusoidal signal at the 
relay input. This gain is automatically set by the limit cycle oscillation to such 
a value that the loop gain becomes 0.5 at the frequency of the limit cycle. 

The result is illustrated by an example. 


EXAMPLE 10.2 A basic SOAS 
Assume that the linear parts are characterized by the transfer function 


K & 
oO) = Gar Gre) 
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Figure 10.9 Simulation of an SOAS applied to the system in Example 10.2. 
The dashed line shows the desired response ym. 


From Example 8.1 the period of the limit cycle is approximately given by 


Ou = Ja 
The magnitude of the transfer function at this frequency is 


Gio) = zya 
If the relay amplitude is d, it follows that the amplitude of the limit cycle 
oscillation at the relay input is approximately given by 


Kd 


OT ha 


The limit cycle amplitude is thus inversely proportional to œ. A simulation 
of the system is shown in Fig. 10.9. The feedforward transfer function is a 
second-order system with the damping 0.7 and the natural frequency 1 rad/s. 
The nominal values of the parameters are K = 3, d = 0.35, and a = 20. The 
approximate analysis gives a limit cycle with period T = 1.4 and amplitude 
0.05. The process gain is suddenly increased by a factor of 5 at £ = 25, Notice the 
rapid adaptation, However, the amplitude of the oscillation will also increase 
by a factor of 5. If the value of d is chosen such that the error would be 0.05 
for the higher value of K, then the system becomes too slow for small K. O 


432 Chapter 10 Robust and Self-oscillating Systems 


Design of an SOAS 


The self-oscillating adaptive system is a simple nonlinear feedback system that 
is capable of adapting rapidly to gain variations. The system has a continuous 
limit cycle oscillation. This is not suitable when valves or other mechanical 
parts are used as actuators. However, an SOAS may conveniently be used with 
thyristors as actuators. The presence of the limit cycle oscillation may also 
cause other inconveniences. Since the system will automatically adjust to an 
amplitude margin A,, = 2, it is also necessary that the characteristics of the 
process be such that this design principle gives suitable closed-loop properties. 
The key problem in the design of the SOAS is the compromise between the 
limit cycle amplitude and the response speed. This compromise is influenced 
by the selection of the linear compensator, G;(s), and of the relay amplitude. 
(Compare Fig. 10.7.) The design for an SOAS can be described by the following 
procedure. 


Step 1: The relay amplitude is first determined such that the desired control 
authority (tracking rate, force, speed, etc.) is obtained. This can be estimated 
by analyzing the response of the process to constant control signals. 


Step2: When the relay amplitude is specified, the desired limit cycle frequency 
can be determined from the condition 


d|G,(i@,)| = eo 


where eg is the tolerable limit cycle amplitude in the error signal and G,(s) is 
the transfer function of the process. It is necessary to check that the frequency 
obtained is reasonable. For example, the frequency @, may become so high 
that the process dynamics become uncertain. 


Step 3: The final step is to determine the transfer function Gy of the linear 
compensator such that 


arg G;(i@,) + arg G,(ia,) = —z 


A large phase lead may be necessary, but this may not be realizable because 
of noise sensitivity. 


Step 4: Check that the linear closed-loop system with the loop gain Go = 
KG;G, will work well when the gain K is adjusted so that the amplitude 
margin is 2. If this is not the case, the compensator G; must be modified. O 


Notice that it is necessary to have an estimate of the magnitude of the 
process transfer function in Steps 1 and 2. Knowledge of the phase curve of 
the process transfer function is necessary in the third step. Also notice that it 
may not be possible to resolve the compromises in all steps. It is then necessary 
to add additional loops for changing the gain. 
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Gain Changers 


External feedback loops, which adjust the relay amplitude, may be used to 
resolve the compromise between a high tracking rate and a small limit cycle 
amplitude. The so-called up-logic used in the first SOAS can be described as 


follows: ; 
d= f dı if Jel > e 
dat (di —dg)e 7 if {el < er 


The time tọ is the last time that |e] < e. The relay amplitude is thus increased 
to dı when the error exceeds a limit e. The relay amplitude then decreases 
to a lower level dz when the error is less than e. This gain changer increases 
the relay amplitude and the response rate when large reference signals are 
applied. 

Another type of gain changer has been used to control the amplitude of the 
limit cycle. The limit cycle amplitude at the process output 1s measured by a 
band-pass filter and a rectifier. The relay amplitude is then adjusted to keep 
the limit cycle amplitude constant at the process output. 


Dither Signals 


In some applications it is desirable to avoid the limit cycle. One idea that has 
been used successfully is to introduce a variable gain after the relay. The gain 
is adjusted so that the limit cycle vanishes. In the early applications it was 
difficult to implement multiplications. A trick that was used to implement the 
multiplication is illustrated in Fig. 10.10. A high-frequency triangular wave is 
added to the signal before the relay. With low-pass filtering, the average effect 


Dither signal N Low-frequency 
Dither characteristic of y 


mu “Ete. 


Figure 10.10 The principle of using a dither signal. 
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of the additive triangular signal is the same as multiplication by a constant. 
The constant is inversely proportional to the amplitude of the triangular wave. 
The triangular wave is called a dither signal. Use of a dither signal is an 
Hlustration of the idea that an oscillation may be quenched by another high- 
frequency oscillation. 


EXAMPLE 10.3 SOAS with lead network and gain changer 

The relay control in Example 10.2 gave an error amplitude of about eg = 
0.03. Assume that we want to decrease the amplitude by a factor of 3 while 
maintaining d = 0.35. This gives a new oscillation frequency @!, such that 

d |G (ia',)| = 0.01 
or ©, = 10 rad/s. To get this oscillating frequency, a lead network Gy is added 
such that 

arg Gr(i@,) + arg G,(i@),) = —z 

Figure 10.11 shows a simulation of the system in Example 10.2 with the 
compensation network 
s+a 
s+15 
As in Fig. 10.9, the gain is increased by a factor of 5 at ¢ = 25. It is seen that 
the lead network decreases the amplitude of the oscillation while maintaining 


Gr(s) = 1.2 


l u 
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Figure 10.11 Simulation of the system in Example 10.3 using an SOAS 
with a lead network. The dashed line shows the desired response Ym. 
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Figure 10.12 Simulation of the system in Example 10.3 using an SOAS 
with a lead network and a gain changer. The dashed line shows the desired 
response Ym. 


the response speed. To speed up the response, we can introduce the up-logic 
for the gain. Figure 10.12 shows a simulation in which dı = 0.5, dz = 0.1, 
and e; = 0.1. The error signal is decreased, but there is still an oscillation. 
The behavior of the closed-loop system can be sensitive to the choice of the 
parameters in the gain changer. Too large a value of d; will cause the error 
to be larger than eo, and there will be no decrease in d nor in the amplitude 
of the error. The oscillation can be quenched by adding a dither signal at the 
input of the process. oO 


The examples show how the properties of the SOAS can be changed by 
using lead filters, gain changers, and dither signals. 


Externally Excited Adaptive Systems 


A system that is closely related to the SOAS is obtained by injecting a high- 
frequency sinusoid to measure the gain of the process and to set the controller 
gain. Such a system is called an externally excited adaptive system (EEAS) and 
gives the designer more freedom than the SOAS because the frequency of the 
excitation can be chosen more easily. This system is used for track-keeping in 
compact dise players. The main source for the parameter variation is a gain 
variation in the laser diode system. 
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Summary 


The basic SOAS is simple to implement and can cope with large gain changes 
in the process. Result 10.1 shows that the SOAS will automatically adjust 
itself so that the amplitude margin is 2. However, the limit cycle in the SOAS 
is noticeable and can be disturbing. The introduction of lead network, gain 
changer, and dither can decrease the amplitude of the oscillation. The EEAS 
is a similar system in which a high-frequency signal is introduced externally. 


10.4 VARIABLE-STRUCTURE SYSTEMS 


In Section 10.2 we showed how fixed robust controllers can be obtained by 
increasing the complexity of the controller. Another way to obtain a robust 
controller is to use a special version of on-off control called a variable-structure 
system (VSS). The key idea is to apply strong control action when the system 
deviates from the desired behavior. The name “variable-structure” alludes to 
the fact that the controller structure may be changed. 


Sliding Modes 


One way to change the structure of the system is to use different controllers 
in different parts of the state space of the system. Consider the case in which 
the control law switches on the surface 


a(x) =0 
Assume that the closed-loop system is described by 
dx _ f f*(x) o(x) > 0 
dt ~ { f(x) o) <0 (10.5) 


Two situations may occur, which for the two-dimensional case are shown in 
Fig. 10.13. In Case (a) the trajectories will pass the switching curve and 
continue into the other region. However, the dynamics are different in the two 
regions. In Case (b) the vector fields will drive the state toward the surface 
o(x) = 0. The control will change rapidly from one value to another on the 
switching surface, This is called chattering. The net effect is that the state will 
move toward the surface o(x) = 0 and then slide along the surface. This is 
called sliding mode. This sliding motion can be described as follows: Let fn 
denote the projection of f on the normal of the surface o(x) = 0. Introduce a 
number @ such that 
afr t-a) =0 
The sliding motion is then given by 


of = aft + (1-a)f 
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a(x)>0 


Figure 10.13 The trajectories of Eqs. (10.5) at the switching surface. (a) 
No sliding mode; {b} sliding mode. 


This was formally shown by A. F. Filippov in the early 1960s. The control law 
giving the sliding motion is sometimes called the equivalent control law. If the 
switching is not ideal, then the trajectory will move back and forth over the 
switching surface. This is the case, for instance, when there is hysteresis in the 
switching. However, on average the motion will be along the switching surface. 


Stability and Robustness 


In a variable-structure system we attempt to find a switching surface such 
that the closed-loop system behaves as desired. We now construct a variable- 
structure controller. For this purpose we assume that the system that we want 
to control is described by the nonlinear equation 


d'y _ dy d'y dy d™ty 
So = fl 3 dee? dtr} +91 Y: dt UY gr-1 u 


If we introduce the state 


—1 9 T 
x= [= y a*y dy ) (10.6) 
dti dp? dt 
the system can be written as 

00... 0 0 fi(x) + gi(x)u 

dx 10... 0 0 0 

al: . x + 
; ` ` (10.7) 
00... 1 0 0 

= f(x) + o(x)u 
y=(0 0... O I)x 


where f(x) and g(x) are vectors. The system is nonlinear but is affine in the 
control signal. Further, it is assumed that all the states can be measured and 
that the state vector has the special form given in Eq. (10.6). For simplicity 
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it is assumed that the purpose of the control is to find a controller such that 
x = Q is an asymptotically stable solution. The problem with constant reference 
signals is considered in Problem 10.14 at the end of the chapter. 

There are three important questions that must be answered for VSS: 


e Will the trajectories starting at any point hit the switching line? 
« Is there a sliding mode? 
« Is the sliding mode stable? 


There are partial answers to these questions in the literature on VSS. For the 
special type of system defined by Eqs. (10.7) it is easy to derive a controller 
that makes the sliding mode stable. 

Let the switching surface be 


a(x) = pixi + Poke +++ + PnX%n =p x = 0 (10.8) 
Using the definition of the state vector, we find that 


a(x) = pix) + pay) +--+ pay = 0 


The dynamic behavior on the sliding surface can be specified by a proper choice 
of the numbers p;. The motion is determined by a differential equation of order 
— 1. Kt will be stable if the polynomial 


P(s) = pis”) + pos”? + +++ + Dn (10.9) 


has all its roots in the left-half plane. 
To determine a control law that keeps the system on o(x) = 0, we introduce 
the Lyapunov function V(x) = o7(x)/2. The time derivative of V is given by 
Vv 
OM = 0 (2)0(x) = x" pps 
=x" p (p f(x) + p’g(x)u(t)) 


Choose the control law 


u(t) = FE — T sign (o (x)) (10.10) 
Then dy 
i —pio(x) sign(o{x)) (10.11) 


which is negative definite, This implies that o{x) = 0 is asymptotically stable. 
Notice that there is a discontinuity in the control signal when the switching 
surface is passed. 

Assume that the system has initial values such that o(x) = fọ > 0, and 
let tg be the time when the switching surface is reached the first time. From 
Eq. (10.11) we find that 

G(x) = —y 
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Integrating this equation from 0 to te gives 


which gives tg = Oo/p. Using the same arguments for co < 0 shows that 
te = |Oo|/u. With the control law given by Eq. (10.10) the state will thus reach 
the switching surface in finite time. The subspace o(x) = 0 is asymptotically 
stable, and the state will stay on the switching surface once it is reached. The 
motion along the surface is determined by Eq. (10.9). 

Uncertainties in f and g can be handled if z is sufficiently large. Assume 
that the design of the control law is based on the approximate values f and ĝ 
instead of the true ones. Then 


dV pT( fo? -fo")p p? 
—— =g A7 O PS sigo) 
dt prô pĝ 


The right-hand side is negative if p is sufficiently large, provided that p? § and 
pg have the same sign. The system will thus be insensitive to uncertainties 
in the process model. 

One way to design a variable-structure system is to first transform the 
system to the form given by Eqs. (10.7). This is possible for controllable linear 
systems without zeros and for some classes of nonlinear systems. A stable 
switching surface in the transformed variables is then determined. The system 
and the switching criteria can then be transformed back to the original state 
variables. Notice, however, that all states must be measured. 


Smooth Control Laws 


The control law (10.10) has the drawback that the relay chatters. One way to 
avoid this is to make the relay characteristics smoother. To do this, introduce 
a boundary layer around the switching surface 


B(A = {xl lox) se} £20 


The parameter £ can be interpreted as a measure of the thickness of the bound- 
ary layer. The sign function in Eq. (10.10) is now replaced by the saturation 
function 


1 O>e 
sat(o, €) = oje —E SOLE 
+] O < AE 
The control law is then 
T 
Pf H 
u(t) = = — — sat(o (x), € 10.12 
(£) To pfo (o(x), €) (10.12) 


The width of the boundary layer will influence the tracking performance and 
the bandwidth of the closed-loop system. 
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EXAMPLE10.4 Second-order VSS 
Consider the unstable system 


dx 1 0 1 
a 3 0 = H em 
y=(0 1)x 


which has the transfer function 


1 
0) = s6=T) 


To design a variable-structure controller we determine the closed-loop dynam- 
ics by choosing the switching line 
a(x) = pix + pz%2 = 11 + x2 


Along the sliding line oc = 0 we have 


dy 
dt 


Since the system is in controllable form, the closed-loop behavior is independent 
of the system parameters at the sliding mode. The sliding mode controller 


o(x) =x + xg = +y=0 


-2 -l 0 1 2 


xı 
Figure 10.14 Phase portrait of the system in Example 10.4. The dashed line 
shows o(x) = 0. 
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Figure 10.15 The states and the output as a function of time in Exam- 
ple 10.4. The initial conditions are x, (0) = 1.5 and x,(0) = 0. The controllers 
are (a) Eq. (10.10) with u = 0.5; (b) Eq. (10.12) with x = 0.5 and £ = 0.01. 


(10.10) is now 
pt Ax . 
u(t) = -B — psign(o(x)) 
=—(2 0)x(t) — usign(o(x)) 
The phase plane of the system is shown in Fig. 10.14 when 4 = 0.5. The input 
and the output for one initial value are shown in Fig. 10.15(a). The trajectories 


hit the switching line g = 0 and stay on it. This implies that the control signal 
will chatter. Using the control law (10.12), 


u(t) =—(2 0) x(t) — usat(o(x),e) 


with € = 0.01, gives the behavior shown in Fig. 10.15(b). The control signal is 
now smooth, but the differences in the state trajectories are negligible. o 


Summary 

Variable-structure systems are related to the self-oscillating adaptive systems 
(SOAS). In variable-structure systems we want the system to get into a sliding 
mode to obtain insensitivity to parameter variations. The control signal of 
variable-structure systems will chatter in the sliding mode. The chatter can be 
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avoided by smocthing the relay characteristics. The amplitude of the control 
signal is determined by the magnitude of the state variables or the error. With 
this modification the variable-structure system can be regarded as an SOAS 
in which the relay amplitude depends on the states, The switching condition is 
a linear function of the error in the SOAS, while in variable-structure systems 
it is a nonlinear function of the states. l 

The theory on VSS can be extended to controllers, in which the feedback 
is done from a reduced number of state variables. However, the conditions 
will become more complex than those discussed in this section. There will be 
more constraining conditions on the choice of the switching plane. Since the 
conditions for the existence of a sliding mode depend on the process and the 
switching plane, there have been attempts to make adaptive VSS by adaptation 
on o(x). 

One main drawback of variable-structure systems is the problem of choos- 
ing the switching plane. It also requires measurement of all state variables. 
Another drawback is the chatter in the control signal im the sliding mode. 


10.5 CONCLUSIONS 


Robust high-gain control can be very effective for systems with structured pa- 
rameter variations, where the range of the variations is known. If the parame- 
ter bounds are uncertain, high-gain design methods will lead to a complex and 
conservative design. Relay feedback is an extreme form of high-gain systems. 
In this chapter we have described different ways to use relay feedback to obtain 
systems that are insensitive to parameter variations. Self-oscillating adaptive 
systems and variable-structure systems are two applications of this idea. The 
SOAS can be designed to work quite well, but it requires engineering effort 
and some knowledge of the process to get a satisfactory performance of the 
closed-loop system. These drawbacks have resulted in lack of interest in the 
SOAS. However, the ideas behind SOAS have become useful in connection with 
the auto-tuning of simple controllers, as discussed in Chapter 8. 


PROBLEMS 


10.1 Determine whether each of the following plants can be stabilized by a 
linear fixed-parameter compensator when a € [—1, I]: 


(a) a/s; (b) if/(st+a); (ce) 1/(1+as); (d) a/(14+s); 
(e) a/(1 — s). 


10.2 Consider the process 


G(s)=e  Te[0 1] 


10.3 


10.4 


10.5 


10.6 


10.7 
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Figure 10.16 The system in Problem 10.3. 


(a) Show that the process can be controlled by a controller of the struc- 
ture in Fig. 10.1 with 


0.6(1 + s/1.3) 
s(1 + s/2) 


1l+s 
Grr(s) = IFJ 


Gra (s) = 


(b) Simulate the behavior for changes in the command signal and step 
disturbances at the output. 


(c) Discuss how to make a self-tuning regulator based on pole placement 
for the process. 


Consider the linear closed-loop system shown in Fig. 10.16 with the same 
G(s) as in Example 10.2 and with æ = 20. Determine the gain K so that 
the amplitude margin is A,, = 2. Simulate the system and determine its 
step response. Compare this with the step response of the corresponding 
SOAS in Example 10.2. 


Consider a linear plant with the transfer function 


k 
G(s) = a4 1p 
where the gain k may vary in the range 0.1 < k < 10. Determine the 
relay amplitude d and a suitable lead network so that the limit cycle 
amplitude at the process output is less than 0.05 and the rise time to 
a step of unit amplitude is never less than 0.5. Simulate the resulting 
design and verify the results. 


Consider the system in Example 10.2. Experiment with a gain changer 
of the up-logic type. Investigate how a dither signal will influence the 
performance of the closed-loop system. 


Consider the system in Problem 10.4. Design a gain changer that keeps 
the limit cycle amplitude at 0.01 for the whole operating range. 


Consider a system with the transfer function 
k 


OS) = I 
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10.8 


10.9 


10.10 


10.11 


10.12 


10.13 


10.14 
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where the gain k may change in the range 0.1 to 10. Design a servo 
using the SOAS principie so that the closed-loop transfer function is 


1 


OS) = syst 


independent of the process gain. 


Consider the system in Example 10.4 and assume that the controller 
(10.10) is used with u = 0.5. Assume that the process is changed such 
that 

dx 

7 = axı + bu 
Determine values of a and b such that the closed-loop system is still 
asymptotically stable. 
Assume that the nth-order single-input, single-output system 


dx 
q; Att Bu 


is in companion form and that the control law is of the form 


u = 5 lix; — u sign(a(x)) 


i=l 


Derive the necessary and sufficient conditions for the existence of a 
sliding mode. When will the sliding mode be stable? 


Consider the process in Problem 1.9. Design a robust controller for the 
system, Investigate the disturbance rejection of the closed-loop system. 


Consider the process in Problem 1.10. Design a robust controller for the 
system. Investigate the disturbance rejection of the closed-loop system. 


Design an SOAS for the system in Problem 1.9, and investigate its 
properties. 


Design an SOAS for the system in Problem 1.10, and investigate its 
properties. 


Consider the process in Eqs. (10.7). Assume that the reference value is 
constant y,. The desired state is then 


xg=(O ... 0 y) 


Determine a controller such that x = xg is an asymptotically stable 
solution. (Hint: Introduce the state error = x — xz, and consider the 
Lyapunov function V(x} = o7(x)/2.) 
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CHAPTER 11 


PRACTICAL ISSUES 
AND IMPLEMENTATION 


11.1 INTRODUCTION 


The previous chapters were devoted mainly to development of algorithms and 
analysis of adaptive systems. In this chapter we discuss practical implemen- 
tation of adaptive controllers. The presentation will be guided by theoretical 
considerations, but since the issues are quite complicated, theory can cover 
only part of the problems. Several issues will therefore be solved in an ad hoc 
manner and verified by extensive experimentation and simulation. 

Since an ordinary digital controller is an integral part of an adaptive 
controller, it is essential to master implementation of conventional controllers. 
Some aspects of this are covered in Section 11.2. The discussion includes 
computational delay, sampling, prefiltering and postfiltering, and integrator 
windup. Automatic design of a controller is another important part of an 
adaptive controller. In this book we have mostly used simple design methods 
based on pole placement. In Section 11.3 we discuss how this design method 
can be modified in several ways to accommodate more complex specifications 
and to make it more robust. The design technique requires the solution of a 
Diophantine equation. Efficient numerical methods for doing this are discussed 
in Section 11.4. 

Parameter estimation is another important part of an adaptive controller. 
Implementation of estimators is discussed in Section 11.5. This includes choice 
of model structure, data filters, and excitation. The ability of an adaptive con- 
troller to track time-varying parameters is an important issue. There are sev- 
eral ways to do this. Two techniques, exponential forgetting and covariance 
resetting, are discussed in detail. It turns out that exponential forgetting in 
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combination with poor excitation can give rise to an undesirable effect called 
covariance windup. This phenomenon is discussed in detail together with sev- 
eral ways of avoiding it. The techniques discussed include constant trace algo- 
rithms, directional forgetting, and leakage. A technique to make the estimator 
less sensitive to outliers is also discussed in Section 11.5. It is important to 
have numerically efficient methods for recursive estimation. Square root al- 
gorithms, which are numerically superior to the conventional algorithms, are 
discussed in Section 11.6. 

In Section 11.7 we discuss the interaction between estimation and control. 
We show that difficulties can arise if integral action is implemented inappro- 
priately. We also show how the criteria for control and estimation can be made 
compatible by appropriate choices of the data filter and the experimental con- 
ditions. 

Some prototype algorithms are given in Section 11.8. To implement a 
control system successfully, it is necessary to consider all situations that may 
occur in practice. Conventional controllers typically have two operating modes: 
manual and automatic, Adaptive controllers have many more modes. This 
is discussed briefly in Section 11.9, which covers startup, shut-down, and 
switching between different modes. Supervision of adaptive algorithms is also 
discussed in that section. 


11.2 CONTROLLER IMPLEMENTATION 


An ordinary controller is an important part of an adaptive controller. Compare 
with the block diagram in Fig. 1.1. It is, of course, important that the controller 
be implemented in a good way. Implementation of digital controllers is treated 
in books on digital control. Some aspects are summarized in this section. 


Computational Delay 


Because the analog-to-digital (A-D) and digital-to-analog (D-A) conversions 
and the computations take time, there will always be a delay between the 
measurement and the time the control signal is applied to the process. This 
delay, which is called the computational delay, depends on how the control law 
is implemented in the computer. Two ways are illustrated in Fig. 11.1. In Case 
(a) the measured variable at time f is used to compute the control signal 
applied at time f,;:. In Case (b) the control signal is applied as soon as the 
computations are finished. The disadvantage of Case (a) is that the control 
action is delayed unnecessarily. In Case (b) the disadvantage is that the time 
delay may change, depending on the lead on the computer or changes in the 
program. In both cases it can be necessary to include the computational delay 
in the design of the controller. 
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tei tr t} Time tra ty fz11 Time 


Figure 11.1 Two ways to synchronize inputs and outputs. (a) The signals 
measured at time f, are used to compute the control signal applied at fy- 
(b) The control signal is applied as soon as it is computed. 


In Case (b) it is desirable to make the delay as small as possible. This can 
be done by performing as few operations as possible between the A-D and D-A 
conversions. Assume that the regulator has the form 


u(t) + ryu(f—1)+...+7r,u(t—&) . 
= fouel) +... + tmuelt— m) —soy(t) —... — sıy(t — È) 
This equation can be written as 


u(t} = totte(t) — soy(t) + w(t — 1) 
where 


w(t — 1) = huelt — 1) +... + tyue(t — m) — s1y(f — 1} — ...— sy(t — L) 
—ryu(t—1)—...—r,u(t —k) 
Notice that the signal u’(t — 1) contains information that is available at time 


é— 1. An implementation of the control algorithm that exploits this to make 
the computational delay as small as possible is the following: 


1. Perform A-D conversion of y(t) and u-(t). 
2. Compute u(t) = éou-(t) — soy(é) +u (t — 1). 
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3. Perform D-A conversion of u(t). 
4, Compute 


w (E) = tytie(t) +... + tmue{t —m +1) — siy(t) — ... — sıy(t— l + 1) 
—ryu(t} —...—r,zu(t~k+1) 


The computational delay can thus be significantly reduced by a proper 
implementation of the controller. In a proper implementation the delay can be 
reduced. Apart from the two multiplications and the two additions, u(t} must 
be tested for limitations and anti-reset windup must be done in a proper way. 
Since the computational delay appears in the same way as a time delay in the 
process dynamics, it is important to take it into account in designing a control 
system. A common rule of thumb is that the time delay can be neglected if 
it is less than 10% of the sampling period. For high-performance systems it 
should always be taken into account. Since the time delay is not known until 
the algorithm has been coded, the control design may have to be repeated. For 
an adaptive system it is important that the model structure is chosen so that 
the computational delay can be accommodated. In multitasking systems it may 
also happen that the computational delay varies with time. 


Sampling and Pre- and Postfiltering 


The choice of sampling rate is an important issue in digital control. The sam- 
pling rate influences many properties of a system such as following of command 
signals, rejection of load disturbances and measurement noise, and sensitivity 
to unmodeled dynamics. Selection of sampling rates is thus an essential design 
issue. 

One rule of thumb that is useful for deterministic design methods is to let 
the sampling interval h be chosen such that 


Woh % 0.2 — 0.6 


where @, is the natural frequency of the dominating poles of the closed-loop 
system. This corresponds to 12-60 samples per undamped natural period. The 
sampling frequency is @, = 27/h. 

In all digital systems it is important that signals are filtered before they 
are sampled. All components of the signal with frequencies above the Nyquist 
frequency, @y = @,/2 = t/h should be eliminated. If this is not done, a signal 
component with frequencies @ > @y will appear as low-frequency components 
with the frequency 


Oa = |((@ + wy} mod ws) —a@y| 


This phenomenon is called aliasing, and the prefilters introduced before a 
sampler are called anti-aliasing filters. Suitable choices of anti-aliasing filters 
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Table 111 Damping and natural frequency of second-, fourth-, and sixth- 
order Butterworth, ITAE, and Bessel filters. The filters have the bandwidth 


Oy. 
Order Butterworth ITAE Bessel 
o/s č OEF č ojog 4 
2 1 0.71 1 0.71 1.27 0.87 
4 1 0.38 1.48 0.32 1.59 0.62 
1 0.92 0.83 0.83 1.42 0.96 
6 1 0.26 1.30 0.32 5.14 0.49 
1 0.71 0.98 0.60 4.57 0.82 
1 0.97 0.79 0.93 4.34 0.98 


are second- or fourth-order Butterworth, ITAE (integral time absolute error), 
or Bessel filters. They consist of one or several cascaded filters of the form 


u? 


s? + 20 as + w? 


Let Wg be the desired bandwidth of the filter. The damping ¢ and the frequency 
co for filters of different orders are given in Table 11.1. The Bessel filter has 
the interesting property that its phase curve is approximately linear, which 
implies that the waveform is also approximately invariant. 

The prefilter mtroduces additional dynamics into the system that have to 
be taken into account in the control design. The Bessel filter can be approxi- 
mated with a time delay. Assume that the bandwidth of the filter is chosen to 
be 


G,(s) = 


(Gaa (ion) = F l 
where G,,(s) is the transfer function of the filter and ay = 2/h is the Nyquist 


frequency. Parameter £ is the attenuation of the filter at the Nyquist frequency. 
Table 11.2 gives the approximate time delay Ty as a function of f. The table 


Table 11.2 The approximate time delay Ty due to the anti-aliasing filter 
as a function of the desired attenuation f at the Nyquist frequency for a 
fourth-order Bessel filter. A is the sampling period. 


B Oy [Oz Tafh 


0.05 3.1 2.1 
0.1 2.5 1.7 
0.2 2.0 1.3 
0.5 L4 0.9 


0.7 1.0 0.7 
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also gives @y as a function of filter bandwidth wg. The relative delay increases 
with attenuation. For reasonable values of the attenuation the delay is more 
than one sampling period. This means that the dynamics of the filter must be 
taken into account in the control design. We illustrate this by an example. 


EXAMPLE 11.1 The effect of the anti-aliasing filter 
Consider a process described by 


1 

PES s{s + 1) 

A pole placement controller is designed to give a closed-loop system whose 
dominant poles are given by ©m = 1 rad/s and m = 0.7. The digital controller 
has a sampling period of h = 0.5. To illustrate the effect of aliasing, we assume 
that the output of the system is disturbed by a sinusoidal signal; that is, the 
measured signal is 

¥m(t} = y(£) + aa sin(aat) 

with ag = 0.1. This signal is filtered through a fourth-order Bessel filter with 
bandwidth @,. Figure 11.2 shows a number of simulations of the system that 
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Figure 11.2 Output, reference value, and control signal for the system in 
Example 11.1. Fhe measurement disturbance has the frequency wz = 11.3 
rad/s. (a) @g = 25 rad/s; (b) @s = 6.28 rad/s; (c) @s = 6.28 rad/s and 
the regulator compensated for a delay of 0.7h; (d) @g = 2.51 rad/s and the 
regulator compensated for a delay of 1.7h. 
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illustrate the effects of aliasing and filtering. Figure 11.2(a) shows setpoint, 
process output, and control signal. Since the bandwidth of the filter is @p = 25, 
the measured signal is not attenuated much by the filter. The disturbance 
with frequency @g = 11.3 is aliased to 1.3 rad/s because of the sampling 
with Nyquist frequency @y = 27. The alased disturbance is clearly visible 
in the process input and output. In Fig. 11.2(b) the bandwidth of the prefilter 
is reduced to @g = 6.28 rad/s. This bandwidth is not sufficiently small to 
give a substantial reduction of the disturbance. Notice also that the overshoot 
has increased a little because of the delay in the prefilter. In Fig. 11.2(c) the 
dynamics of the prefilter have been taken into account by adding a time delay 
of 0.7h in the model. The overshoot is then reduced, but the effect of the 
disturbance is similar to Fig. 11.2(b). In Fig. 11.2(d) the filter bandwidth has 
been reduced to wg = 2.51. The disturbance is now reduced significantly. We 
have also taken the dynamics of the filter into account as a time delay of 1.7h 
in designing the controller. The aliased disturbance is now barely noticeable in 
the figure. o 


Example 11.1 shows that it is important to use an anti-aliasing filter and 
that the filter has to be considered in the design. For a Bessel filter, however, it 
is sufficient to approximate the filter by a time delay. The additional dynamics 
cause no principle problems for an adaptive controller because all parameters 
are estimated. However, inclusion of the prefilter dynamics will increase the 
model order significantly. In the particular case of Example 11.1 the model 
will increase from second order to sixth order. This means that the number of 
parameters that we have to estimate increases from 4 to 12. A simple way to 
reduce the number of parameters is to approximate the prefilter by a delay. It 
is then sufficient to estimate five parameters of the model 


y(t) + ary(t — R) + azy(t — 2h) = 
bou(t — dh) + biul(t — dh —h) + boult — dh — 2h) 


where the value of d depends on the bandwidth of the filter (see Table 11.2). 

It is cumbersome and costly to change the bandwidth of an analog prefilter. 
This poses problems for systems in which the sampling rate has to be changed. 
A nice implementation in such a case is to use dual rate sampling. A high fixed 
sampling rate is used together with a fixed analog prefilter, A digital filter is 
then used to filter the signal at a slower rate when that is needed. This implies 
that fewer parameters have to be estimated. 

The output of a D-A converter is a piecewise constant signal. This means 
that the control signal fed to the actuator is a piecewise constant signal that 
changes stepwise at the sampling instants. This is adequate for many pro- 
cesses, However, for some systems, such as hydraulic servos for flight control 
and other systems with poorly damped oscillatory modes, the steps may excite 
these modes. In such a case it is advantageous to use a filter that smooths 
the signal from the D-A converter. Such a filter is called a postsampling filter. 
The postsampling filter may be a simple continuous-time filter with a response 
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time that is short in comparison with the sampling time. Special D-A convert- 
ers that give a smooth signal have also been constructed. Another solution to 
the problem is to use a system with dual rate sampling. The primary control 
system should then be designed so that the output is piecewise linear between 
the sampling instants. A fast sampling can then be used to generate an ap- 
proximation to this signal, possibly followed by an analog postsampling filter. 


Controller Windup 


Linear theory is adequate to deal with many problems in control system design. 
There is, however, one nonlinear problem that we must deal with in almost all 
practical control systems, and that is actuator saturation. The feedback will 
be broken when the actuator saturates. Large deviation may then occur if the 
process or the controller is unstable. A simple case in which this occurs is when 
the controller has integral action, The phenomenon was first observed in con- 
nection with PID control. It is therefore often called integrator windup because 
the integral term “winds up” when the actuator is saturated. Since integral 
action was also called reset, the phenomenon is also called reset windup. It is 
necessary to include a scheme for avoiding windup in systems in which the 
process and/or the controller is unstable. 

There are many different ways to introduce anti-reset windup. One simple 
way is based on the interpretation of a controller as a combination of a state 
estimator and state feedback. Such a system is shown in Fig. 11.3. The con- 
troller is composed of two components, a state estimator and a state feedback. 
The state estimator determines an estimate of the state based on the process 
input and output. The state feedback generates the control signal based on the 
estimated state. It is intuitively clear from the figure that the state estimator 
will perform poorly when the actuator saturates because it is uses a wrong 


(a) 


Figure 11.3 Block diagrams of controllers based on state feedback and state 
estimation with a process having a nonlinear actuator. 
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value of the control signal applied to the process. This interpretation also sug- 
gest that the problem can be avoided by feeding back the actual process input, 
Ua, or an estimate of it as in Fig. 11.8(b). 

Since we are using polynomial representations of the controller in this 
book, we also give a polynomial interpretation of the scheme. Consider the 
controller 

R(g)u(t) = T(q)ue(t) — S(a) y(t) 


where the polynomial R(g) is assumed to be monic. The controller can be 
written in observer form as 


A,(q)u(t) = T(q)uc(t) — S(q@)y(t) + (Alq) — R(q))u(2) 


where A,(q) is the observer polynomial. Let the saturating actuator be de- 
scribed by the nonlinear function f(u). A controller that avoids windup is then 
given by 
Ao(q)u(t) = T(q)uc(t) — S(a)y(#) + (Ao(a) — R(q)) u(t) (11.1) 
u(t) = f(v(8)) 


A similar scheme can be used when the saturation is dynamic. Notice that the 
controller responds with the observer dynamics when the feedback is broken. 
A particularly simple case is when At = 1, which corresponds to a deadbeat 
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Figure 11.4 Simulation of adaptive control of the unstable process with a 
saturating actuator in Example 11.2. 
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observer. The controller is then 


u(t) = f(T*(a")uc(t) — S (ayt) + A -— R* (qult) 
We illustrate windup by an example. 


EXAMPLE 11.2 Windup and how to avoid it 


Consider the simple example of adaptive contro] in Example 3.5. The process 
has the transfer function 

1 
s(s +1) 


Assume that there is an actuator that saturates when the magnitude of the 
control signal is 0.5. Figure 11.4 shows the behavior of the system if no pre- 
cautions are taken in the controller, The figure clearly shows the detrimental 
effects of actuator saturation. The process runs open loop when the actuator 
saturates, and the output is drifting because the process has an integrator. 
This will also happen with a controller with fixed parameters. With an adap- 
tive controller the saturation also causes the gain parameters (bp and 6;) to 
be underestimated. The controller gain is then too high, and the system be- 


G(s) = 
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Figure 11.5 Simulation of an adaptive controller for a process with a sat- 
urating actuator with a controller having windup protection. Compare with 
Fig. 11.4, which shows the same simulation for a system without windup 
protection. 
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comes unstable. Windup is thus much more serious in adaptive control than 
in a controller with constant gain. 

Figure 11.5 shows a simulation corresponding to Fig. 11.4 when the mod- 
ification (11.1) is introduced to avoid windup. In this case there are clearly 
no difficulties. The control signa] remains within the bounds —0.5 < u < 0.5 
all the time. If the control signal saturates over a longer period of time, the 
adaptation should be switched off. l D 


Windup will always cause difficulties. In cases like Example 11.2 the phe- 
nomenon is serious because the process is unstable. If the controller is unstable, 
the control law given by Eqs. (11.1) will automatically reset the controller state 
with a speed corresponding to the observer dynamics. 


11.3 CONTROLLER DESIGN 


Automatic control design is another important part of an adaptive controller. 
Fairly simplistic design methods were used in developing the adaptive con- 
trollers in Chapters 3, 4, and 5. Simpie methods were used to keep the overall 
complexity of the system reasonable. To achieve this, it was sometimes neces- 
sary to assume that the processes were minimum-phase systems. This was the 
case, for example, for model-reference adaptive systems. Since control design is 
done automatically in closed loop, it is also necessary to introduce safeguards 
to make sure that all conditions required for the design method are fulfilled. 
For instance, it may be necessary to test whether the estimated process model 
is minimum-phase or whether there are common factors in the estimated poly- 
nomials. Direct adaptive controllers have the advantage that the design step 
is eliminated, since the parameters of the regulator are estimated directly. No- 
tice, however, that several assumptions are made implicitly in using the direct 
algorithms. Direct methods are also restricted to special classes of systems. 


Design Procedures 


Many different design procedures can he used for adaptive control. Feedback 
by itself can make a closed-loop system insensitive to variations in process 
dynamics. There are also special so-called robust design methods that take 
process uncertainty into account explicitly. In deriving an adaptive controller 
it seems appealing to base it on a robust design method. It is also of interest to 
try to combine robust and adaptive control The estimator should then provide 
estimates of the model and its uncertainty. The design method should take the 
uncertainty into account. Unfortunately, control and estimation theory has not 
yet progressed to the state in which such estimation and control procedures 
are available. Many of the robust design methods do also require manual 
interaction. Such procedures cannot be used in an adaptive controller. The pole 
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placement design procedure is quite useful in practice, in spite of its simplicity. 
However, it can be improved significantly by some simple modifications that 
give more robust closed-loop systems. Some ways to do this are discussed in 
this section. 


Specifications 


To obtain a robust controller, it is very important that specifications be chosen 
in a sensible way. With a pole placement design, this means that the desired 
closed-loop poles have to be chosen with care. Poles that are too fast will give 
controllers that are very sensitive. This can be understood from the following 
expression, which gives a sufficient condition for stability of a pole placement 
design: 

H(2)T (2) 
Hm(2)}S(2) 
In this expression, H is the pulse transfer function of model used to design the 
controller, Ho is the pulse transfer function of the true plant, Hm = Bm/Am 
is the desired response, and S and T are the controller polynomials. The 
inequality should hold on the unit circle. The condition implies that high model 
precision is required for those frequencies at which the desired closed-loop 
system has significantly higher gain than the model. 

A reasonable way to determine the closed-loop poles is to make the obser- 
vation that it is difficult to obtain a crossover frequency that is significantly 
higher than the frequency at which the plant has a phase lag of 180°—270°. No- 
tice that these frequencies can conveniently be determined by a relay feedback 
experiment. 


|H (2) — Ho(2)| < 


Youla Parameterization 


The Diophantine equation is a key element of pole placement design. This 
equation has many solutions. If the polynomials R° and S° are solutions of the 
equation 

AR?’ + BS? = A? 
it follows that the polynomials R and S given by 

R=XR°+YB 
(11.2) 
S=XS°-YA 


satisfy the equation 
AR+BS=XA° 


If a controller characterized by the polynomials R’ and S° gives a closed-loop 
system with the characteristic polynomial A®, then the controller 


(XR° 4+ YB)u = —(XS°— YA)y (11.8) 
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Figure 11.6 Block diagram of the closed-loop system with the controller 
(11.4). 


gives a closed-loop system with the characteristic polynomial X A°. The system 
is stable if the polynomial X is a stable polynomial but the polynomial Y can be 
chosen arbitrarily. It thus follows that if the controller R°u = —S°y stabilizes 
the system Ay = Bu, then all controllers that stabilize the system are given by 
Eq. (11.3). The equation is called the Youla parameterization of all controllers 
that stabilize the system. Equation (11.3) can also be written as 


S° Y 
u = TROY + yp Bu) (11.4) 
This control law is illustrated by the block diagram in Fig. 11.6. 


Robust Fole Placement 


The Youla parameterization can be used to impose extra conditions on the 
controller. This idea was used in Section 3.6 to obtain controllers that have 
integral action. We now use it to improve the robustness of the controller. 
One way to do this is to require that the controller have small gain for those 
frequencies at which the process is very uncertain. For example, it is useful 
to require that the controller should have zero gain at the Nyquist frequency. 
This is accomplished by the condition S(—1) = 0. We can also require that the 
controller gain be zero at frequency Wo. This is equivalent to requiring that 
the polynomial 
g — 2cos(@oh)g +1 

be a factor of S(q). To satisfy such requirements, the order of the closed-loop 


system must thus be increased. The additional poles introduced are specified 
by the polynomial X. We illustrate this by an example. 
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EXAMPLE 11.3 Robust pole placement 


Assume that we have obtained a controller R°, S° that gives a closed-loop 
system with the characteristic polynomial A? and that we want to improve its 
robustness by requiring that S(—1) = 0. To do this, we introduce one more 
closed-loop pole. We choose 

X(q) =q- x 


with |xo| < 1. The polynomial Y can also be of first order. Equation (11.2) gives 
S(q) = (g — xo)S° (a) — (a — 90) A(Q) 
Requiring that S(—1) = 0 gives 


(1 + x)S°(—1) 


The robust controller is then characterized by 
R(q) = (q — xo)R° (g) + (a — 90) B(@) 
S(q) = (g — x0)8°(g) — (q — yo) A (a) 


Notice that it is possible to proceed recursively to make the controller more 
and more complex. oO 


Decoupled Command Signal and Disturbance Responses 
Consider the process 
Ay = Bu +v 
and the controller 
Ru = Tu, — Sy 
The closed-loop system is characterized by 
ya Bt 
A. 
AT 


Ue + pi U 

A 

ats. (11.5) 
A, "A, 

where A, = AR + BS is the closed-loop characteristic polynomial. Assume 

that no process zeros are canceled, and factor the characteristic polynomial as 

A, = AjAm. If we choose T = T’A,, Eqs. (11.5) become 


BT’ R 
YS Ag Babes 
wu 47 mn- Sov 

An i Aom 
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The command signal response is governed by the dynamics of Am, but the 
disturbance response is governed by the dynamics of A,A,,. In this sense it 
is thus coupling between the command signal response and the disturbance 
response. In some cases it is desirable that the dynamics of the command 
signal responses and the disturbance responses are completely decoupled. This 
can be achieved by requiring that T = T'A, and R = R'A,,. The closed-loop 
system is then characterized by 


¥ Br a 4%, 
Am “Ao 
un AT,» 

An © AoAm 


The Diophantine equation then becomes 
AA, R' + BS = AjAm 


To have a causal controller, we must require that deg Am > deg A. The 
minimum-degree causal solution to this equation is such that deg S = deg A+ 
deg Am — 1. Furthermore, deg R = deg Am + deg A, — deg A. To have a causal 
controller, we must thus require that deg S < deg R. This implies that 


deg A + deg Am — 1 < deg Am + deg A, — deg A 


Hence deg A, > 2deg A-— 1. We thus find that the minimum-degree solution 
that decouples the response to command signals and disturbances is such that 


deg A, =n deg A, = 2n—1 deg k = deg S = 2n — 1 


where n = deg A. 
A design of this type can be very useful when there are very noisy mea- 
surements and a fast setpoint response is desired. 


11.4 SOLVING THE DIOPHANTINE EQUATION 


Several of the design methods discussed earlier involve the solution of a Dio- 
phantine equation 
AR+BS=A, (11.6) 


Efficient methods for solving this equation are needed. The equation is linear 
in the polynomials R and S. A solution always exists if A and B are relatively 
prime. However, the equation has many solutions. This is easily seen: If R° 
and S° are solutions, then 

R=R°+BQ 


S=S8°—AQ 
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are also solutions, where Q is an arbitrary polynomial. A particular solution 
can be specified in several different ways. Since a controller must be causal, 
the condition deg S < deg Æ must hold. This condition will restrict the number 
of solutions significantly. An efficient way to solve the equation is to use a 
classical algorithm of Euclid. 


Euclid’s Algorithm 


This algorithm finds the greatest common divisor G of two polynomials A and 
B. If one of the polynomials, say B, is zero, then G is equal to A. If this is not 
the case, the algorithm is as follows. Put Ap = A and Bo = B and iterate the 
equations 

An+1 = By 

Basi = Ay mod Bn 


until B„+ı = 0. The greatest common divisor is then G = B,. When A and B 
are polynomials, A mod B means the remainder when A is divided by B. This 
is in full agreement with the case when A and B are numbers. Backtracking, 
we find that G can be expressed as 


(11.7) 


AX+BY=G (11.8) 


where the polynomials X and Y can be found by keeping track of A, div B, in 
Euchd’s algorithm. This establishes the link between Euclid’s algorithm and 
the Diophantine equation. The extended Euclidean algorithm gives a conve- 
nient way to determine X and Y as well as the minimum-degree solutions U 
and V to 


AU +BV =0 (11.9) 
Equations (11.8) and (11.9) can be written as 
r(3)= (v v) le}- (0) ca 


The matrix F can thus be viewed as the matrix, which performs row operations 
on (A B)’ to give (G 0)” A convenient way to find F is to observe that 


xX Y Al1l0) (GX z) 
U vy) (Boi1y louyv 
The extended Euclidean algorithm can be expressed as follows: Start with the 
matrix 
A 10 
M = 
(5 o1) 


If we assume that deg A > deg B, then calculate Q = A div B, multiply the 
second row of M by Q, and subtract from the first row. Then apply the same 
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procedure to the second row and repeat until the following matrix is obtained: 
F X S 
0 U Y 
A nice feature of this algorithm is that possible common factors in A and 


B are determined automatically. The essential difficulty in implementing the 
algorithm is to find a good way to test for a polynomial being zero. 


Solving the Diophantine Equation 


By using the extended Euclidean algorithm it is now straightforward to solve 
the Diophantine equation 
AR + BS = A. (11.11) 


This is done as follows: Determine the greatest common divisor G and the as- 
sociated polynomials X, Y, U, and V using the extended Euclidean algorithm. 
To have a solution to Eq. (11.11), G must divide A,. A particular solution is 
given by 


R’ = XA, divG 
Ac (11.12) 
S? = YA, divG 
and the general solution is 
R=R°+QU 
g (11.13) 
S=S8°+QV 


where Q is an arbitrary polynomial, The minimum-degree solution is obtained 
by choosing Q = —S° div V. This implies that S = S° mod V. 


Relations to Ordinary Linear Equations 


By equating coefficients of equal order, the Diophantine equation given by 
Eq. (11.11) can be written as a set of linear equations: 


1 oOo... O & oO... 0 
ay 1 Er : by bo ES : ry Qo1 — Gi 
aa a > 0 bb œœ O : : 

: : . I : : - bo rk _ Gen — On (11.14) 
Qn: ai b, : by So Qen+l 

0 an >: O bn 

. . . - . S? Qek+Hl+l 

0 O a 0 O bn 


k columns i+ 1 columns 
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The matrix on the left-hand side is called the Sylvester matrix; it occurs fre- 
quently in applied mathematics. It has the property that it is nonsingular if 
and only if the polynomials A and B do not have any common factors. If there 
are no common factors, a unique solution to Eq. (11.14) exists. Notice, however, 
the nonuniqueness with respect to the orders of R and S. Different choices 
of k and 7 will give different R and S, as discussed above. The solution to 
Eq. (11.14) can be obtained by Gaussian elimination. This method does not use 
the special structure of the Sylvester matrix. 


11.5 ESTIMATOR IMPLEMENTATION 


There are many issues that have to be considered in the implementation of an 
estimator. This section can be summarized by the following motto: 


“Use only good relevant data, treat it carefully, and don’t throw away 
useful information.” 


The key issue is that we want to obtain a model that is relevant for the 
task of control system design and that we want to track changes in the model. 
The tasks are influenced by many factors. In this section we discuss selec- 
tion of modei structure, filtering and excitation, parameter tracking, estimator 
windup, and robustness modifications. 


Model Structure 


The real physical processes that we try to control may have complicated dynam- 
ics, They may be nonlinear or infinite dimensional. One reason for the success 
of automatic control is that good control can often be based on relatively simple 
dynamical models. Such models can work very well under specific operating 
conditions, but the parameters of the model will depend on the operating con- 
ditions. In adaptive control it is attempted to fit a simple linear model on line 
and to adjust the parameters. For this purpose it is of paramount importance 
to understand what happens in fitting complicated dynamics with simple mod- 
els. One fundamental fact is that the result obtained is crucially dependent on 
the nature of the input signal. This is illustrated by the following example. 


EXAMPLE 11.4 Fitting low-order models to high-order systems 
Consider a process with transfer function G(s). Assume that one attempts to 
model the system by a first-order system with transfer function 

b 
sta 
If the input signal is sinusoidal with frequency wp, it is possible to get a perfect 
fit with finite values of the parameters if Im{G(ia@,)} # 0. Straightforward 


G(s) = 
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calculations show that G(iw,) = G(iq,) if the parameters are chosen to be 


_ @ Re{G(ia,)} 
*— ~Tm{G(ia,)} 
p — —BalG(i@)|? 

Im{G(ia.)} 


The transfer function of the model will then fit the data perfectly, but the 
parameters obtained depend on @,. The parameter values may change signif- 
icantly with the frequency of the input signal. m 


An interesting property of adaptive systems is that the parameters are 
estimated in closed loop. This implies that the simple model in the adaptive 
controller is fitted with the actual signals generated by the feedback. It explains 
intuitively the self-tuning property. 

Another important observation is that the difficulty in on-line parameter 
estimation increases significantly with the number of parameters in the model. 
With many parameters the requirements on excitation also increase. For this 
purpose it is useful to try to reduce the number of unknown parameters as 
much as possible. This can be done by using a priori knowledge. This is often 
expressed in continuous-time models. The results are also strongly application 
dependent. Examples of this are given among the problems in the end of this 
chapter. Models consisting of low-order dynamics and time delays have proved 
very useful in process control. Such models can be represented by pulse transfer 
functions of the form 


_ boz +h, 
H(z) = Zeta) (11.15) 
or b2? b 
_ boz“ + biz + be 
H2(z) = AP + az bay) (11.16) 


where the time delay T is between dh and dh + h. Equation (11.16) can also 
represent second-order oscillatory systems. More b parameters can be included 
if the time delay is uncertain. In many cases it is known a priori that there 
are integrators in the model. This leads to transfer functions that contain the 
factor z — 1 in the denominator. 


Data Filters and Excitation 
Assume that the process is described by the discrete-time model 


y(t) = Go(qg)u(t) + v(t) (11.17) 


Notice that possible anti-aliasing filters appear as part of the process Go(q). 
The disturbance v(t) can be the sum of deterministic, piecewise deterministic, 
and stochastic disturbances. The signal has low-frequency and high-frequency 
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Figure 11.7 Amplitude curve for the data filter H;(q). 


components. In stochastic control problems it is important to design a con- 
troller that is tuned to a particular disturbance spectrum. In that case it is, of 
course, important to estimate the disturbance characteristics. In a determinis- 
tic problem we are concerned primarily with the term Go(q)u(t) in the above 
equation, and we are not particularly interested in the detailed character of 
the disturbance v(t}. In the following discussion we consider this case. 

The presence of the disturbance v(t} will, of course, create difficulties in the 
parameter estimation. However, the effect of u(t) can be reduced by filtering. 
Assume that we introduce a data filter with the transfer function Hp and that 
we apply this filter to Eq. {11.17}. Then 


¥(t) = Golau r(t) + vz () (11.18) 


where 


yet} = Hrast)  ur(t)= Hrg) and _—su (t) = Helaj) 


By a proper choice of the data filter we may now make the relative influence of 
the disturbance term smaller in Eq. (11.18) than in Eq. (11.17). The filtering 
should also emphasize the frequency ranges that are of primary importance 
for control design. The disturbance v(t) typically has significant components 
with low frequencies. Low-frequency components should thus be reduced. Very 
high frequencies should similarly be attenuated. One reason for this is that if 
the model 


A(g)yr(t) = B(qju c(t) 


is fitted by least squares, it is desirable that A(q)vr(t} be white noise. Since 
filtering with A implies that high frequencies are amplified, it means that v¢(t) 
should not contain high frequencies. The data filter will therefore typically have 
band-pass character, as shown in Fig. 11.7. The center frequency is typically 
around the crossover frequency of the system. 

In Section 3.5. we suggested using a filter with the transfer function 


1 


H= TMA 
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This filter is a typical low-pass filter that does not attenuate low frequencies. 
In Section 11.7 we will present other ways to choose the data filter. A typical 
data filter is given by 

(1-a)(q-1) 


Hyla) = E 


Some ways to choose the data filter will be discussed later. 

It has been emphasized many times that it is necessary for the input signal 
to be persistently exciting of sufficiently high order to estimate parameters 
reliably. Taking into account that we are fitting low-order models to high-order 
systems, it is also necessary that persistency of excitation be achieved with 
signals in a frequency band where model accuracy is required. 


Parameter Tracking 


The key property of an adaptive controler is its ability to track variations 
in process dynamics. To do so, it is necessary to discount old data, a process 
that involves compromises. If parameters are constant, it is desirable to base 
the estimation on many measurements to reduce the effects of disturbances. 
If parameters are changing, however, it can be very misleading to use a long 
data record, since the parameters may not be the same. There are many ways 
to accommodate this problem. The best solutions are obtained if the nature 
of parameter variations is known. There are two prototype situations. One 
case is when parameters are slowly drifting; the other is when parameters are 
constant for long periods and jump from one value to another. Many attempts 
have been made to deal with the problem of parameter tracking, and there 
is a substantial literature. Most work is based on the assumption of detailed 
descriptions of the nature of parameter variations. A typical example is that. 
the parameters are Markov processes with known transition probabilities, Such 
detailed information about the parameter variations is rarely available, and we 
therefore give some heuristic ways to deal with parameter tracking. 


Exponential Forgetting 


Exponential forgetting is a way to discard old data. It is based on the as- 
sumption that the least-squares loss function is replaced by a loss function in 
which old data is discounted exponentially. It follows from Theorem 2.4 that 
the recursive least-squares estimate with exponential forgetting is given by 


6(t) = 6(t—1) + K (£) (o(t) — 97 (He — 1) 
K(t) = P(t- p(t) (4+ 97 OPE- De)” (11.19) 


P() = -KO H) PE- 
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Table 11.8 Relations between the ratio T;/h and the coefficient 4. 


Teh A 
1 0.87 
2 0.61 
5 0.82 
10 0.90 
20 0.95 
50 0.98 
100 0.99 


where the sampling period A was chosen as the time unit. The forgetting factor 
is given by 

As —~h{T; 
where 7; is the time constant for the exponential forgetting. To make an 
assessment of reasonable values of the forgetting factor, we give the values 
of the forgetting factor for different ratios T;/h in Table 11.3. 

It is possible to generalize the method with exponential forgetting and have 
different forgetting factors for different parameters. However, this requires 
information about the nature of the changes in different parameters. Another 
modification is to modify Eqs. (11.19) so that only the diagonal elements are 
divided by A. 

Tracking of a time-varying parameter is illustrated by an example. 


EXAMPLE 11.5 ‘Tracking parameters of a time-varying system 
Consider a process described by the differential equation 
dy 
— = —yļt K 
Y = y(t) + Kp(t)u 
where the process gain is time varying. The process is controlled by an indirect 
adaptive controller that estimates parameters of the discrete-time model 
y(Rh +h) + ay(kh) = bu(kh) 
and designs a controller with integral action using robust pole placement with 


An(g) = q + am = q — eli 


and 
l A o(a) = q + ao = q — e*l T 
Straightforward computations give a controller of the form 


u(kh) = toue(kh) + tuclkh — h) — soy(kh) — siy(kh —h) +u(kh — h) 


470 Chapter 11 Practical Issues and Implementation 


Ue 


¥ 
I j r ro P j 
-1 ' fod = LS ha H ii r f. iii 
400 600 800 


0 200 


0 200 400 600 800 1000 
Time 


Figure 11.8 Tracking time-varying parameters. 


where 

i _ Itam 
nar 
fi = Qoto 

1+ +am—a 
s= —— yp 
s _ Om +a 
m8 


Since only the process gain is unknown, we will estimate only the parameter b. 
In the simulation we will also assume that the gain varies sinusoidally between 
the values 0.1 and 1,9 with the period 400. In Fig. 11.8 we show a simulation of 
the system when the command signal is a square wave with period 50, there is 
measurement noise with the standard deviation 0.02, and the forgetting factor 
is A = 0.95. Notice that the gain variation is clearly noticeable in the shape of 
the control signal, which changes significantly over one step. Figure 11.8 shows 
that the estimated gain lags the true gain. The forgetting factor is A = 0.95, 
and the sampling period is h = 0.5. The time constant associated with the 
exponential forgetting is then Ty = 10 s, which is a crude estimate of the 
time lag in the estimator. Notice also that the lag is different for increasing 
and decreasing gains, a feature that indicates the nonlinear nature of the 
problem. The forgetting factor can be decreased to reduce the tracking lag. 
The estimates will then have more variation. To illustrate this, we simulate 
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Figure 11.9 Parameter tracking error b = 6 — b for different forgetting 
factors: (a) 4 = 0.1; (b) A = 0.7; (e) A = 0.95. 


the same system as in Fig. 11.8 with different forgetting factors. The results 
are shown in Fig. 11.9. The figure shows that the forgetting factor A = 0.95 
is too large because the systematic tracking error is too large. The forgetting 
factor A = 0.1, on the other hand, is too small, and the systematic error is 
small, but the random component is large. In this particular case the value 
A = 0.7 is a reasonable compromise. The reason for the low value of A is that 
the parameter variations are quite rapid. oO 


Covariance Resetting 


In some situations the parameters are constant over long periods of time and 
change abruptly occasionally. Exponential forgetting, which is based on the 
assumption of a behavior that is homogeneous in time, is less suitable in 
this case. In such a situation it is more appropriate to reset the covariance 
matrix of the estimator to a large matrix when the changes occur. This is 
called covariance resetting. We illustrate this method by an example. 


EXAMPLE 116 Covariance resetting 


Consider the same system as in Example 11.5, but assume now that the 
parameter is piecewise constant. In Fig. 11.10 we show the results obtained 
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Figure 11.10 Tracking piecewise constant parameters using exponential 
forgetting when 4 = 0.95. 


with exponential forgetting with 4 = 0.95. The figure shows clearly that the 
estimate of the process gain responds quite slowly when the gain changes. 
Notice also the strong asymmetry in the response of the estimate when the 
gain changes. It takes much longer for the estimate of the gain to increase 
than to decrease. The reason for this is the large difference in excitation. Also 
notice the stepwise nature of the estimates. Good excitation is obtained only 
when the command signal changes. In Fig. 11.11 we show the same system as 
in Fig. 11.10 with A = 1 and covariance resetting. The covariance matrix is 
reset by reducing A to 0.0001 when the parameter changes. Notice the drastic 
difference in the tracking rate. o 


The example clearly illustrates the advantage of using covariance resetting 
when the parameters change abruptly. To use this effectively, it is necessary 
to detect the changes in the parameters. There are many ways to do this 
by analyzing residuals or parameter changes. It is also possible to reset the 
covariance periodically. 


Parallel Estimators and Other Schemes 


There are many other ways to deal with parameter tracking. One possibility 
is to have several parallel estimators with different forgetting factors and to 
choose the one in which the estimates have the smallest residuals. It is also 
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Figure 11.11 Tracking piecewise constant parameters using covariance re- 


setting. 


possible to have- several parallel estimators that are reset periodically in a 
staggered way. There are also other schemes in which the forgetting factor is 
made signal dependent. 


Estimator Windup 


Exponential forgetting works well only if the process is properly excited all the 
time. There are problems with exponential forgetting when the excitation is 
poor. To understand this, we first consider the extreme case in which there is 
no excitation at all, that is, @ = 0. The equations for the estimate then become 


olt +1) = 6(8) 
P(t+1)= PO) 


The equation for the estimate @ is thus unstable with all eigenvalues equal to 1, 
and the equation for the P-matrix is unstable with all eigenvalues equal to 1/A. 
In this case the estimate will thus remain constant, and the P-matrix will grow 
exponentially if A < 1. Since the estimator gain is Pø, the gain of the estimator 
will also grow exponentially. This means that the estimates may change very 
drastically whenever p becomes different from zero. The phenomenon is called 
estimator windup in analogy with integrator windup. 
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A similar situation occurs if the regression vector is different from zero but 
restricted to a subspace. We illustrate this by an example. 


EXAMPLE 11.7 A constant regression vector causes windup 
Consider a process with the transfer function 


__B 
sd sta 
with an indirect adaptive controller based on estimation of parameters a and 
6 in the discrete-time model 


y(kh +h) + ay(kh) = bu(kh) 


The control design is the same as in Example 11.5. The controller has integral 
action. The parameters have the values æ = 1 and f = 1, the sampling period 
is h = 0.5 s, there is measurement noise with a standard deviation of 0.05, 
the setpoint is piecewise constant, and the forgetting factor is ÀA = 0.95. To 
illustrate the effect of poor excitation, the setpoint will be kept constant for 
long periods of time. 

The parameters are in R*. To have excitation, the regression vectors should 
also span R? persistently. When the setpoint is constant, the input and the 
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Figure 11.12 Illustration of estimator windup due to poor excitation. (a) 
Output y and setpoint u,; (b) covariance pu; (c) control signal u; and (d) 
estimator gain ķi. 
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Figure 11.13 Parameter estimates in the case of estimator windup due to 
poor excitation. The dashed lines show the correct values of the parameters 
and of the gain of the process, 


output settle to constant values after a transient. The regression vector then 
becomes 


p(t) = (—uc auc/b) 


This vector lies in a one-dimensional subspace of R? and is thus not persistently 
exciting. The simulation shown in Fig. 11.12 illustrates the behavior of the 
system. The process output tracks the command signal quite well, and the 
control signal is also quite reasonable. Figure 11.12 shows that the element pır 
of the P-matrix grows approximately exponentially during the periods when 
the command signal is constant. The deviations are due to the measurement 
noise that pives some excitation. The other elements of the P-matrix behave 
similarly. The estimator gains also grow significantly. Exponential growth of 
the P-matrix and the associated increase in the estimator gains are clearly 
noticeable in Fig. 11.12. The parameter estimates will change significantly, as 
shown in Fig. 11.18. The estimates are very inaccurate at the end of the periods 
when the command signal has been constant. In Fig. 11.13 we also show the 
estimate of process gain calculated from 


b 
I+ê 
This estimate is very good for the whole period because this variable is well 


k= 
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excited. This is why the controller behaves reasonably well in spite of the poor 
estimates of a and b. = 


To get further insight into the windup phenomenon, we make a simplified 
analysis of the behavior shown in Example 11.7. For this purpose we assume 
that the regression vector is constant, that is, p(t) = po. The inverse of the 
P-matrix is given by 


i 
_ 1-a' 
P(t+1) = APS +Y A popo = APS + 7 Popo 


k=l 


Using the matrix inversion lemma (Lemma 2.1), we find after some calcula- 
tions that the covariance matrix can be written as 


P T 
P(t +1) = 5 (Po- gate — | 


Ata (t) + 93 Popo 
where 
a(t) 1-aA 
~ [At 
Furthermore, we find that 
Popo 


The P-matrix can be decomposed as 


P(t + 1) = P(t) + Blt) 095 
where -4 P Tp 
Pe _ _ PopoG Po ) _ T 
and Tp 
Po topo 
£) = a(t E CYS YP F OE N 
pt) OG Po)? (åta (t) + på Povo) 
The matrix P(¢) is of rank n — 1 with P(t)gp = 0. Since |A] < 1 we have as 


i oo 
alt) -+ 1— AÀ 


1-A 
b(t) -> —-—— 
OO” oop 
In the decomposition of P(t+1) we thus find that the matrix Ë goes to infinity 
as A and that f(t)popf goes to a constant (1 — Apop? /(92 p}. 

Intuitively, the result of the calculation can be interpreted as follows: When 
the regression vector is constant, we obtain information only about the compo- 
nent of the parameter that is parallel to the regression vector. This component 
can be estimated reliably with exponential forgetting. The “projection” of the 
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P-matrix in this direction converges to 1-—A, and the “orthogonal” part of the 
P-matrix goes to infinity as At. Estimator windup is thus obtained by ex- 
ponential forgetting combined with poor excitation. There are several ways to 
avoid estimator windup. We now discuss some of these techniques. 


Conditional Updating 


One possibility to avoid windup in the estimator is ta update the estimate and 
the covariance only when there is excitation. The algorithms obtained are called 
algorithms with conditional updating or dead zones. A correct detection of 
excitation should be based on calculation of covariances or spectra as discussed 
in Section 2.4. Simpler conditions are often used in practice. Common tests are 
based on the magnitudes of the variations in process inputs and outputs or 
other signals such as £ and p7 Pg. Notice that the quantity @” P@ is dimension 
free. 
If the regression vector is constant, it follows from Eq. (11.20) that 


Tp g 
T Plg = a(t) — eo _ 
Po POP = aO Teale) + 98 Popo 
As t > œ, it follows that a{t) + 1— A. If gp? Pp is used as a test quantity, it 
is thus natural to normalize it by 1 ~ A. The effect of conditional updating is 
illustrated by an example. 


EXAMPLE 11.8 Conditional updating 
Consider the system in Example 11.7, but modify the estimator to provide 
conditional updating. In this particular case the estimate is updated if the test 
quantity 
g(t)’ Ppl) > 1—41) 


Figure 11.14 shows a simulation that is comparable to Fig. 11.12. Notice that 
the exponential growth is now avoided. The elements of the P-matrix remain 
bounded, and the estimator gains are well behaved. oO 


The selection of the condition for updating is critical. If the criterion is too 
stringent, the estimates will be poor because updating is done too infrequently. 
If the criterion is too liberal, we get covariance windup. 


Constant-Trace Algorithms 


Another way to keep the P-matrix bounded is to scale the matrix at each 
iteration. A popular scheme is to scale it in such a way that the trace of the 
matrix is constant. An additional refinement is to also add a small unit matrix. 
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Figure 11.14 Illustration of how estimator windup can be avoided with 
conditional updating. Compare with Fig. 11.12. 
This gives the so-called regularized constant-trace algorithm: 
A(t) = O(t ~ 1) + K (6) (v(t) — p” EE 1)) 


-1 
K(t) = P(t—1)o(t) (A+ (t)" P(t — Ye) 
si lfe P(t — PATAP- 1) (1121) 
P = — { Pat — 1) Ha + 
= 3 (Pe-0- “Torre ee) 
P(t) 
P i} = — ENN I 
O=a rpt? 
where c1 > 0 and c3 > 0. Typical values for the parameters can be 
cife = 104 
pp-a > 1 
The constant-trace algorithm may also be combined with conditional updating. 


Directional Forgetting 


Another way to forget old data is based on the fact that one observation gives 
a projection of the parameter on the regression vector. Exponential forgetting 
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can then be done only in the “direction” of the regression vector. This approach 
is called directional forgetting. To derive the equations, we observe that the 
inverse of the P-matrix with exponential forgetting is given by 


P(t +1) = AP) + 99" E) 
In directional forgetting we start with the formula 

P(t +1) = P(t) + oe" (6) 
The matrix P—1(t) is decomposed as 

P(t) = Pl + Yee? E) (11.22) 
where P~1(#)g(t) = 0. This gives 

o = POP Oot) 
(p7 eot)? 

Exponential forgetting is then applied only to the second term of Eq. (11.22), 


which corresponds to the direction where new information is obtained. This 
gives 


P(t 1) = P(t) + Arlote () + ee") 
which can be written as 


7@P()e() 
P(t + 1) = P(t) + (14-1 e) tho? (t 
Verny I O 
There are several variations of the algorithms. The forgetting factor is some- 
times made a function of the data. One method has the property that the 
P-matrix is driven toward a matrix proportional to the identity matrix when 
there is poor excitation. 


Leakage 


Another way to avoid estimator windup, called leakage, was discussed in Sec- 
tion 6.9. In continuous time the estimator was modified as shown in Eq. (6.84) 
by adding the term a@(6°—@). This means that the parameters will converge to 
6° when no useful information is obtained, that is, when e = 0. A similar mod- 
ification can also be made in discrete-time estimators. When a least-squares 
type of algorithm is used, it is also common to add a similar term to the P 
equation to drive it toward a specified matrix. 


Robust Estimation 


The least-squares estimate is optimal if the disturbances are Gaussian and 
such that the equation error is white noise. In practice the least-squares esti- 
mate has some drawbacks because the assumptions are violated. It is a direct 
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consequence of the least-squares formulation that a single large error will have 
a drastic influence on the result because the errors are squared in the crite- 
rion. This is a consequence of the Gaussian assumption that impes that the 
probability of large errors is very small. Estimators with very different prop- 
erties are obtained if it is assumed that the probability for large errors is not 
negligible. Without going into technicalities, we remark that the estimators 
will be replaced by equations such as l 


A(t) = 6(t — 1) + Plot — If (e(t) 
dô 


a = Perle) 


where the function f(e) is linear for small € but increases more slowly than 

linear for large £. A typical example is 
E 

f(e) = IF aje] 


The net effect is to decrease the consequences of large errors. The estimators 
are then called robust. 


11.6 SQUARE ROOT ALGORITHMS 


It is well known in numerical analysis that considerable accuracy may be lost 
when a least-squares problem is solved by forming and solving the normal 
equations. The reason is that the measured values are squared unnecessarily. 
The following procedure for solving the least-squares problem is much better 
conditioned numerically. Start with Eq. (2.4): 


£=-Y—oé 


An orthogonal transformation Q, that is, QTQ = QQ? = I, does not change 
the Euclidean norm of the error 


£=-QE=QY-—Q0e 
Choose the transformation Q so that Q% is upper triangular. The above equa- 


tion then becomes _ 
(e) - (3) - [7]. 

ej) (F 0 
where © is upper triangular. It then follows that the least-squares estimate 
is given by ; 

0 = 7! 

and the error is (é7)72@?. This way of computing the estimate is much more 
accurate than solving the normal equation, particularly if ||£|| < ||Y||. The 
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method based on orthogonal transformation is called a square root method 
because it works with & or the square root of 6? &. There are several numerical 
methods that can be used to find an orthogonal! transformation Q, for example, 
Householder transformations or the QR method. We will not discuss these 
methods further, because we are primarily interested in recursive methods. 


Representation of Conditional Mean Values 


Recursive square root methods can naturally be explained by using probabilistic 
arguments, Some preliminary results on conditional mean values for Gaussian 
random variables will first be developed. We can now show the following result. 


THEOREM 111 Conditional mean values and covariances 
Let the vectors x and y be jointly Gaussian random variables with mean values 


E (z) = (e) (11.23) 


= (J-E $] as 


where R,y = R}, Further assume that dim x = n and dim y = p. The condi- 
tional mean value of x, given y, is Gaussian with mean 


and covariance 


E (x|y) = my + RayRy* (y — my) (11.25) 
and covariance 
cov (x|y) = Ruy = Rx — RyRy Ryx (11.26) 
A nonnegative matrix R can be decomposed as 
T 
Pele n) (S ose J me 
where D, and D, are diagonal matrices and Ly is lower triangular. Then 
RR, = K (11.28) 
and 
Ray = PL,D,L7 (11.29) 
Proof: We first show that the vector z defined by 
z= x — My — Ray R, (y — my) (11.30) 


has zero mean, is independent of y, and has the covariance 


R: = R; — Ry R7" Rye (11.81) 
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The mean value is zero. Furthermore, 
Ez(y— my) =E {(@ ~ Mx)(y¥— my) — RyRy (y —my)(y — my) } 
= Kyy — RyRy Ry = 0 


The variables z and y are thus uncorrelated. Since they are Gaussian, they are 
also independent. It now follows that 


od bese) PP) 
x— My E RyRy’ I z 
The joint density function of x and y is 
f(x,y) = (27) ("PP (det R)" 
1 — 
exp {—5 (Rza + (y—my)" Fy = ms) } 


The density function of y is 


1 
f(0) = (ay (det R, exp { -30y — my) R; (y —m,)} 
where p is the dimension of y. The conditional density is then 


f(x,y) 
f) 
where n is the dimension of x. But 
R R R, R 
det R = det [ ” = ) = aet [ ana 4 
= det R, - det (Rx — RyRy Ryx) = det R, - det F, 


F(xly) = 


= (2x)-"/? (det R}? (det R)" exp 1-3 Rzt\ 


Hence 
f (xly) = (222)? (det R) ee Bes 


where z is given by Eq. (11.30) and R; by Eq. (11.31). 
The first part of the theorem is thus proved. To show the second part, 
notice that Eq. (11.27) is 


D, D,K? 
R=p T T 
KD, L,D,L, + KD,K 
Identification of the different terms gives 
Ry = pDy 
Ry = PKD, 
R = p(L,D,L? + KD,K*) 
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Hence 
RyRy’ =K 


and 


Rey = Rx — Ray R7 Rys = pLa DLT 


Remark. It follows from the theorem that the calculation of the conditional 
mean of a Gaussian random variable is equivalent to transforming the joint 
covariance matrix of the variables to the form of Eq. (11.27). Notice that this 
form may be viewed as a square root representation of R. m 


Application to Recursive Estimation 
The basic step in recursive estimation can be described as follows: Let @ be 
Gaussian N (0°, P). Assume that a linear observation 

y=¢070 +e 


is made, where e is normal N (0, c°). The new estimate is then given as the 
conditional mean E (8|y). The joint covariance matrix of y and @ is 


r= (9 pP) S a) 
[Pe P 0 0 
The symmetric nonnegative matrix P has a decomposition P = LDL’, where 


L is a lower triangular matrix with unit diagonal and D is a nonnegative 
diagonal matrix. The matrix F can then be written as 


_ | g? LDL’ g +0? QTLDLT 


LDLTg LDLT 
1 g™L) (0? 0 1 0 
= 11.32 
k LJL p) late zr) (11.32) 
If this matrix can be transformed to 
1 0 oe 0 1 KT 
R= _ - ` 11.88 
(x i} lo allo a] (11.88) 


Theorem 11.1 can be used to obtain the recursive estimate as 
6 = 69+ K (y— 78) 
with covariance 
P=LDL' 
The algorithm can thus be described as follows. 
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ALGORITHM 111 Square root RLS 


Step 1: Start with L and D as a representation of P. 


Step 2: Form the matrix of Eq. (11.82), where ø is the regression vector. 
Step 3: Reduce this to the lower triangular form of Eg. (11.33). 
Step 4: The updating gain is K, and the new P is represented by Land D. o 


It now remains to find the appropriate transformation matrices. A conve- 


nient method is dyadic decomposition. 


Dyadic Decomposition 


Given vectors 
a={1 a... an)” 
b=(b, ba ... bp) 
and scalars @ and $, find new vectors 
ā=(l1 dg än) 
b=(0 b ... 5,)* 
such that o 
aaa” + Bbb? = aaa? + BET (11.34) 


If this problem can be solved, we can perform the composition of Eq. (11.33) 


by repeated application of the method. 
Equation (11.34) can be written as 


1 bi 
ao ba 
aj. | ag... an)+B] . | (bi be 

Gn bn 
1 0 
ao ` ba 

= a . ( 1 ao wea ay } -+ B . 
a, bn 
Equating the (1,1) elements gives 
a+ pb = 


Equating the (1, k) elements for k > 1 gives 


æa, + Bbib, = dG; 


(11.35) 


(0 be ... bn) 


(11.36) 


(11.37) 
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Adding and subtracting £b?a; give 
(a + Bb?)a, + Bib, — Beñar = Aa, 
Hence 


b 
ad; = Qa + Po (br — biag) (11.88) 


The numbers & and 4; can thus be determined. It now remains to compute f 
and bz. Equating the (&,/) elements of Eg. (11.35) for k,l > 1 gives 
Aaka + Bbpb; = 4,4; + brb 


b bib) aes 
_ (aa, + B Pao +P 151) + Béxb, 


where Eq. (11.37) has been used to eliminate @,é;. Inserting the expression in 
Kg. (11.36) for & gives, after some calculations, 


ČP s> 
(by — biar} (b: — biar) = ap bp by 


PROCEDURE 
DyadicReduction(VAR a,b:col; VAR alpha, beta:REAL; 
i0,i1,i2 :CARDINAL); 


CONST 
mzero = 1.0E-i0; 
VAR 
i : CARDINAL; 
wi,w2,bi,gam : REAL; 
BEGIN 
IF beta<mzero THEN beta:=0.0:; END; 
bi := bid]; 
wi := alpha; 
w2 := beta*bi; 


alpha := alpha + w2*bi; 
IF alpha > mzero THEN 
beta := wi+beta/alpha; 
gam := w2/alpha; 
FOR i:=ii TO i2 DO 


bli] := bli] - biali]; 
ali] := a[i] + gam*b[i]; 
END; 


END; 
END DyadicReduction; 


Figure 11.15 Dyadic decomposition. 
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These equations have several solutions. A simple one is 


b; = by — bya; 
= ap 
P= a 


A solution to the dyadic decomposition problem of Eq. (11.34) is given by the 
equations 


&=a+t Boi 
mi ap 
Pave 
_ Boy 
= g 
by = by — biak k=2,...,n 
Gy, = ak + ybr k=2,...,n 


The algorithm in Fig. 11.15 is an implementation of the dyadic decomposition. 


PROCEDURE 

LDFilter(VAR theta,d:col; VAR l:matr; phi:col; 
lambda:REAL; n:CARDINAL) ; 

VAR 
i,j : CARDINAL; 
e,w : REAL; 


i=" 
co 
Oo 
ell 
1i 


lambda; 
e := phi[0]; 
FOR i:=1 TD n BO 
6:=e-theta Li] *phi [i] ; 
wi=phi [i]; 
FOR j:=iti TO n DO w:=w+philj]*1[i,j]; END; 
1(0,i} :=0.0; 
1[i,0] :=w; 
END; 
FOR i:=n TO i BY ~1 DO (+ Notice backward loop *) 
DyadicReduction(1 [0] ,1 [i] ,d{0] ,d[i] ,0,i,n); 
END; 


FOR i:=1 TO n DO 
theta [i] :=theta[i]+1(0,i] *e; 
d [i] :=d[i] /lambda; 
END; 
END LDFilter; 


Figure 11.16 LD decomposition. 
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In this code, the type 
col = ARRAY[O..maxindex] OF REAL; 


has been introduced. By using the procedure DyadicReduction it is now 
straightforward to write a procedure that implements Algorithm 11.1 Sucha 
procedure is given in Fig. 11.16. The algorithm performs one step of a recursive 
least-squares estimation. Starting from the current estimate @, the covariance 
represented by its LD decomposition, and the regression vector, the procedure 
generates updated values of the estimate and its covariance. The data type 


matr = ARRAY(O..maxindex] OF col; 


is used in the program. The starting values can be chosen to be L = J and 
d = [Bo, Bo,--- Bo]. This gives LD = Bol. 


11.7 INTERACTION OF ESTIMATION AND CONTROL 


Parameter estimation and control design were treated as two separate subjects 
in the previous sections of this chapter. In an adaptive controller there are, of 
course, strong interactions between estimation and control. Some consequences 
of this interaction are discussed in this section. 


Computational Delay 


The updating of the estimated parameters and the design are done at each 
sampling instant. The timing of computations of the controller was discussed 
in Section 11.2. We pointed out that it is important to have as short a compu- 
tational delay as possible. The dual time scale of the adaptive control problem 
implies that the process parameters are assumed to vary slowly. This means 
that the parameter estimates from the previous sampling instant can be used 
for calculating the control signal. There will thus be no extra time delay due to 
the adaptation, provided that the parameter update and the controller design 
are done after the control signal is sent out to the process. 


Integral Action 


Practically all controllers need integral action to ensure that calibration errors 
and load disturbances do not give steady-state errors. In Section 3.6 we showed 
how the design procedure could easily be modified to give controllers with 
integral action. In that section it was also shown that a particular adaptive 
controllers automatically gave zero steady-state error. This situation occurs 
quite frequently. It is also easy to check whether a particular self-tuner has 
this ability by investigating possible stationary solutions. A typical example is 
the following. 
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EXAMPLE 11.9 Obtaining integral action automatically 


Consider the simple direct moving-average self-tuning controller described in 
Chapter 4, which is based on least-squares estimation and minimum-variance 
control. The estimation is based on the model 


y(t +d) = R*(q)u(é) + S*(a™) (2) 
and the regulator is : 
u(i) = -F 90 


The conditions for a stationary solution are that 


N 
1 
Fy(t) = lim y 2 (k + t){k) = 0 t=d,...,d+l 
k=1 


N 
fult) = lim 5 Sie + r)u(k) sO 2000 ER 
k=1 


where & and / are the degrees of the R* and S* polynomials, respectively. These 
conditions are not satisfied unless the mean value of y is zero. When there is 
an offset, the parameter estimates will get values such that R*(1) = 0, that 
is, there is an integrator in the controller. However, the convergence to the 
integrator may be slow. o 


A second way to explicitly eliminate steady-state errors is to base an adap- 
tive controller on estimation of parameters in the model 


A(g)y(t) = B(aju (t) +v 


where v is a constant that is estimated. The control design should also be 
modified by introducing a feedforward from the estimated. disturbance. This 
approach has the drawback that an extra parameter has to be estimated. 
Furthermore, it is necessary to have different forgetting factors on the bias 
estimate and the other estimates; otherwise, the convergence to a new level 
will be very slow. Finally, if the bias is estimated in this way, it is not possible 
to use the self-tuner as a tuner, since there will be no reset when the estimation 
is switched off. This is a simple example that shows the drawbacks of mixing 
the functions of the feedback loop and the adaptation loop. A much better way 
is to design a controller with integral action, for example, by using the methods 
discussed in Section 3.6. A data filter of band-pass character should also be 
used so that the disturbance v does not influence estimation too much. We will 
also show how a similar approach can be used for a direct self-tuner. 


Compatible Criteria for Identification and Control 


So far, we have treated identification and contro] as two different tasks. The 
criterion for the identification (least squares) was chosen largely on an ad 
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hoc basis. It is clearly desirable to try to find a criterion for identification 
that matches the final use of the model. This is in general a very complicated 
problem. We therefore discuss a simplified case. Consider a process described 
by the model 
A(a)y(t) = B(a)u(é) (11.39) 

where u is the control signal and y is the measured variable. Let the controller 
be 

R(q)u(t) = T (g)ue (t) — S(9) y(t) (11.40) 
where u, is the setpoint and R(¢), S(g), and T(q)} are polynomials. The poly- 
nomials R(g) and S(g) satisfy the Diophantine equation 


A(Q)R(q) + B(a)S (g) = An(@)Ao(@) (11.41) 


where the desired closed-loop polynomial is As(q)Am(q). This equation has 
many solutions. It is customary to choose the simplest one that gives a causal 
controller, but it is also possible to introduce an auxiliary condition. Integral 
action is obtained by finding a solution such that R(1) = 0. High-frequency 
roll-off is obtained by requiring that S(—1) = 0. 

The polynomial T(q) is given by 


T (g) = teAd(q) (11.42) 


where to = Am (1)/B(1). If R(i) = 0, it also follows that T(1) = S(1). Combin- 
ing Eqs. (11.39) and (11.40), we get 


Bq) 


y(t) = to —= u(t) 
An (q) (11.43) 
_, Aq) 
u(t) = to Anla e(t) 


Polynomials A,(q) and Am(q) are typically chosen to give good rejection of 
disturbances and insensitivity to modeling errors and measurement noise. 

It is desirable to formulate the adaptive control problem in such a way that 
the goals for control and identification are compatible. If this is done, it means 
that a model is fitted in such a way that it matches the ultimate use of of the 
model. 

Consider the situation in which the goal is to control a plant with transfer 
function Po. A controller is designed by using pole placement based on the 
approximate model whose transfer function is P = B/A. To compute the control 
law, the parameters of the polynomials A and B are estimated by using least 
squares, and the controller is then determined by the pole placement method. 
Let uo and yo denote the inputs and outputs that are obtained in controlling 
the actual plant, and let u and y denote the corresponding signals when the 
controller controls the design model. The control performance error can then 
be defined as 

Eep = Yo — Y 
We have the following result. 
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THEOREM 112 Compatibility of identification and control 
The control performance error e,, is identical to the least-squares estimation 
error if identification is performed in closed loop and if the transfer function 
of the data filter is chosen to be 

R 


Proof: The proof is a straightforward calculation. The output of the true 
system is given by 


PoT 
yo = RiPS Ue (11.45) 
and the control signal is 
T 


The corresponding signals for the nominal] plant are obtained simply by omit- 
ting the index 0 on yo, uo, and Po. The control performance error then becomes 
_ r= (one Pot PT aps) te = RT (Po — P) 
R+P)S R+PS ~ (R + PoS)(R + Ps) “° 
_ R(Po — P) in = AR(Po — P) 
T R+PS ° ASA, 
where the first equality follows from Eqs. (11.48), (11.45), and (11.46). The 
second equality is obtained by a simple algebraic manipulation. The third 
follows from Eq. (11.46), and the last equality follows from Eq. (11.41). The 
least-squares estimation error is given by 


€ = H {Ayo —_ Buo) 
It follows from Eq. (11.47) that e and e,, are identical if estimation is based on 
closed-loop data and if the data filter is chosen to be Eq. (11.44). m 
Remark I. Notice that the denominator of the filter (11.44) is given by A,An, 
which are given by the specifications. 


Remark 2. Notice that for a controller with integral action the filter (11.44) 
is a bandpass filter. 


Remark 3. Notice that only the numerator of the filter has to be adapted. O 


This result gives a rational way of choosing the data filter for a servo 
problem. 


(11.47) 


11.8 PROTOTYPE ALGORITHMS 


In this section we present some prototype algorithms for adaptive control. 
Guidelines for the coding of the algorithms are given. The algorithms can easily 
be expanded to a variety of controllers. 
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Algorithm Skeleton 


All adaptive algorithms discussed in this chapter have the following form: 
1 Analog Digital_conversion 


2 Compute_control_signal 

3 Digital_Analog_conversion 

4 If estimate then 

5 begin{estimate} 

6 Covariance_update 

7 Parameter_update 

8 If tune then 

9 begin{tune} 
th_design:=th_estimated 

10 Design_calculations 

11 end{tune} 

12 end{estimate} 

13 Organize_data 

14 Compute_as_mnuch_as_possible_of_control_signal 


Row 1 implements the conversion of the measured output signal, the ref- 
erence signal, and possible feedforward signal. All the converted signals are 
supposed to be filtered through appropriate anti-aliasing filters, as discussed 
in Section 11.2. Row 3 sets the control signal to the process. Rows 14 and 2 con- 
tain the calculations of the control signal, which are independent of whether 
the parameters are estimated or not. Notice the division of the calculations 
of the control signal to avoid overly long computation times. Al! calculations 
that are possible to do in advance are done in Row 14, Only calculations that 
contain the last measurements are done in Row 2. 

Rows 4-13 contain calculations that are specific for an adaptive algorithm. 
There are two logical variables, estimate and tune, which control whether 
the parameters are going to be estimated and whether the controller is going 
to be redesigned, respectively. The estimation is done in Rows 5-7, and the 
design calculations are done in Row 10. Row 13 organizes the data such that 
the algorithm is always ready to start estimation when the operator wishes. 

The various adaptive algorithms discussed in this section differ only in the 
design calculations. The estimator part can be the same for all algorithms. One 
important part of the algorithms that will not be discussed here is the operator 
interface. This is usually a significant part of an adaptive control system, but 
it is very hardware-dependent, so it is difficult to discuss in general terms. We 
now discuss the calculations in Rows 4—13 in more detail. 


Parameter_update 


We assume that the estimated model has the form 
y(t) = o7 (6 
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where the components in the regression vector g are lagged and filtered inputs 
and outputs. The ordering, the number of lags, and so on depend on the specific 
model; these details are easily sorted out for the chosen model structure. Rows 
6 and 13 of the algorithm contain the bookkeeping of the g vector (i.e., the usual 
shift of some parts of the vector and supplement of the latest measurements 
and outputs). This part of the algorithm should also include the data filtering 
discussed in Sections 11.2, 11.8, and 11.7. For simplicity it is assumed that 
the estimation and the covariance update are done by using ordinary recursive 
least squares (Eqs. 11.19). The calculations can be organized as in the listing 
below, where eps is the residual, th_estimated is the parameter vector, P 
is the covariance matrix, phi is the data vector, and lambda is the forgetting 
factor. 

"Compute residual 

eps = y - phi’*th_estimated 

"Update estimate 

w = P*phi 

den = lambda + phi’*w 

gain = w/den 

th_estimated = th_estimated + gain*eps 

“Update covariance 

P = (P - ww’ /den)/lambda 
The prime is the transpose, and + is matrix multiplication. This skeleton can 
easily be transferred to any preferred programming language. 


Organize_data 


This part of the code filters the process input and output by Hp, and it updates 
the regression vector g(t) and the other states of the system. If g(t) is updated 
at each sampling period, it is possible to update the estimates irregularly. 


Design_calculations 


When a direct algorithm such as Algorithm 3.3 is used, the controller param- 
eters are the same as the estimated parameters, and there are no calculations 
that have to be done in the design block. In the indirect methods a polyno- 
mial equation has to be solved. The solution of the Diophantine equation is 
discussed in Section 11.4. Some care must be taken because of difficulties with 
possible common factors in the estimated model polynomials. 


Compute_control_signal 


The computation of the control signal to minimize the computational delay was 
discussed in Section 11.2, along with the anti-reset windup. 
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Summary 


The program skeleton in this section can now be supplemented with details 
to become a complete adaptive control algorithm. These details will depend 
on what algorithm is chosen to be implemented and on which programming 
language is chosen. 


11.9 OPERATIONAL ISSUES 


Simple controllers typically have two operating modes, manual and automatic. 
It is also possible to change the parameters during operation. It is a nontrivial 
task to deal with the operation of a conventional controller. The current practice 
has developed over a long period of time. Adaptive controllers can operate 
in many more ways. It is a difficult problem to find a good solution to the 
operational problems. The problem will also vary widely with the application 
area. In this section we discuss a controller for industrial process control, which 
is designed to operate in many widely different environments. 


Operating Modes 


An adaptive controller has at least three operating modes: manual, constant- 
parameter control, and adaptation. The controllers that are used in constant- 
parameter mode may be of several types: PID, relay, or a general linear con- 
troller. In this mode the controller parameters must also be loaded and stored. 
Parameter estimation may also be initiated in the constant-parameter mode. 
Estimation can also be enhanced by introducing extra perturbations. The na- 
ture of these perturbations must also be specified. 


Initialization 

There are several ways to initialize a selftuning algorithm, depending on the 
available a priori information about the process. In one case, nothing is known 
about the process. The initia] values of the parameters in the estimator can 
then be chosen to be zero or such that the initial controller is a proportional 
or integral controller with low gain. Auto-tuning, discussed in Chapter 8, is 
a convenient way to initialize the algorithm, because it generates a suitable 
input signal and safe initial values of the parameters. This also gives a rational 
way of choosing the sampling interval. 

The inputs and outputs of the process should be scaled so that they are 
of the same magnitude. This will improve the numerical conditions in the 
estimation and the control parts of the algorithm. The initial value of the 
covariance matrix can be 1-100 times a unit matrix if the elements in the 9 
vector are scaled to approximately unity. These values are usually not crucial, 
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since the estimator will get reasonable values in a very short pericd of time. 
Our experience is that 10-50 samples are sufficient to get a very good controller 
when the system is excited. During the initial phase it can be advantageous to 
add a perturbation signal to speed up the convergence of the estimator. 

The situation is different if the process has been controlled before with a 
conventional or an adaptive controller. The initial values should then be such 
that they correspond to the controller used before. Furthermore, the P-matrix 
should be sufficiently small. 

Sometimes it is important to have disturbances be as small as possible, ow- 
ing to the startup of the self-tuning algorithm. There are then two precautions 
that can be taken. First, the estimator can be used for some sampling peri- 
ods before the self-tuning algorithm is allowed to put owt any control actions. 
During that time a safe, simple controller should be used. It is also possible 
and desirable to limit the control signal. The allowable magnitude can be very 
small during the first period of time and can then be increased when better 
parameter estimates are obtained. The drawback of having small input signals 
is that the excitation of the process will be poor, and it will take longer to get 
good parameter estimates. 


Supervision 


An adaptive controller should also contain facilities for supervision. Basic 
statistics such as the mean, standard deviation, and maximum and minimum 
values should be computed for the process input and output. These values 
should be averaged over the basic period of the loop. Since excitation is so impor- 
tant, it should be monitored. The estimation error also gives useful information 
about the behavior of the loop. Common factors in the process model should 
be detected. This will indicate that the model structure should be changed. 
For special algorithms it is also possible to determine whether the controller 
behaves as expected. For example, for minimum variance or moving average 
controllers this can be determined by monitoring the covariance of the process 
output. 


11.10 CONCLUSIONS 


Practical aspects on implementation of adaptive controllers have been dis- 
cussed in this chapter. There are many things to consider, since adaptive con- 
trollers are quite complicated devices. The following are some of the important 
issues: 


e Analog anti-aliasing filters must be used. They are typically second- or 
fourth-order filters that effectively eliminate signal components with fre- 
quencies above the Nyquist frequency 7/h, where h is the sampling period. 
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Figure 11.17 Block diagram of an adaptive control system with added fil- 
ters. Ga is the anti-aliasing filter, G,, is the postsampling filter, and Hp is 
the data filter for the estimation. 


The dynamics of the filters should be taken into account in the control de- 
sign. 

Inputs and outputs should be filtered by a bandpass filter with Hp before 
these signals are sent to the parameter estimator. These filters will remove 
low-frequency disturbances such as levels and ramps. High frequency dis- 
turbances are also removed by Hp. The lower limit of the passband should 
be at least one decade below the desired crossover frequency. Known sinu- 
soidals can also be removed by using notch filters. 


The postsampling filter Gpp is used to avoid excitation of high-frequency 
resonance modes in the process. 


Low-order models are typically used. They are estimated with algorithms 
having time variable exponential forgetting, regularized constant trace, 
or directional forgetting. The estimator should also contain a dead zone. 
Finally, the estimator may contain a “switch,” which detects whether the 
system is sufficiently excited. The “switch” can measure the power in 
different frequency bands and thus control whether the estimator should 
be active or not. Square root algorithms are preferable, particularly if there 
is a high signal to noise ratio. 

The design method for the controller should be robust against unmodeled 
dynamics. Level and ramp disturbances are eliminated by introducing 
integrators in the controller. The control signal should be limited, and 
the controller should include anti-reset windup. 


A block diagram of a reasonably realistic adaptive controller is given in 


Fig. 


11.17, Adaptive controllers also contain parameters. Guidelines for choos- 


ing these have been given in this chapter, 
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PROBLEMS 


i114 


11.2 


11.3 


11.4 


11.5 


11.6 


11.7 


11.8 


How should the disturbance annihilation filter Hp(q) be chosen if v(t) 
in Eq. (11.17) is a sinusoidal? 


Consider the data filter 
Hpg) = 


Discuss how the choice of the parameter œ influences elimination of 
constant disturbances. 


(l-@)G~9) 
q-a 


Plot the Bode diagram for a fourth-order Bessel filter, and compare it 
with a pure time delay. Consider the cases in Table 11.2. 


Determine how the behavior of the anti-reset windup controller of Eqs. 
(11.1) is influenced by the filter A,. 


Complete the algorithm skeleton for the cases of 
(a) a direct self-tuner (Algorithm 3.3). 
(b) an indirect self-tuner without zero cancellation (Algorithm 3.2). 


Perform a transformation of a second- and fourth-order Bessel filter, 
ær = 1, into band-pass filters, using the transformation 


s+ Oro, 


s> ———~ 
s(@,, — w) 


where @; and ©, are the lower and upper cutoff frequencies, respec- 
tively. Use @; = 100 and @, = 1000 rad/s. Compare the band-pass 
characteristics by using Bode diagrams. 


Use Euclid’s algorithm to compute the greatest common divisor of 
A= q’ — 2g + 1.45g — 0.35 
B =œ —11¢+0.3 
Also determine the polynomials X and Y in Eq. (11.8). 
Consider the Diophantine equation 
AR + BS = Á, 


and let 

A(@) = (g — 1)(g — 0.9) 
Use the method in Section 11.4, and compute the resulting controller 
when (a) B(q) = q — 0.6; (b) B(q} = q — 0.9. Assume that the desired 
closed characteristic polynomial is 


Amla) =0°— g + 0.7 
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11.9 One way to solve the Dicphantine equation is to multiply it by a persis- 
tently exciting signal, such as white noise. Introduce the filtered signals 


Va (t) = a v(z) valt) = —_ v(t) 


The Diophantine equation then becomes 
Ru, + Suz = v 


The coefficients of the R and S polynomials can now be determined by 
using the method of least squares, and one iteration can be done at each 
sampling instance. Discuss the merits and drawbacks of this approach. 
(Hint: What is the convergence rate?) 
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CHAPTER 12 


COMMERCIAL PRODUCTS 
AND APPLICATIONS 


12.1 INFRODUCTION 


There have been a large number of applications of adaptive feedback control 
over the past 30 years. Experiments with adaptive flight-control systems were 
done in 1960. Industrial experiments with self-tuning regulators were per- 
formed in 1972. Full-scale experiments with adaptive autopilots for ship steer- 
ing were done in 1973. Special adaptive systems have been in continuous use 
for a long time. Some process control loops have been running continuously 
since 1974. There are also a number of special products that have been op- 
erating for a long time. Commercial systems for ship steering have been in 
continuous operation since 1980. 

Systems implemented by using minicomputers appeared in the early 1970s. 
However, not until the 1980s did adaptive techniques start to have real impact 
on industry. The number of applications increased drastically with the advent 
of the microprocessor, which made the technology cost-effective. Because of 
this, adaptive controllers are also entering the marketplace even in single-loop 
controllers. Several commercial products based on adaptive techniques were 
introduced in the early 1980s, and second- and third-generation versions have 
been introduced in some cases. 

Adaptive techniques are used in a number of products. Gain scheduling is 
the standard method for design of flight control systems for high-performance 
aircraft, and it is also used in robotics and process control. The self-oscillating 
adaptive system is used in several missiles. There are several commercial adap- 
tive systems for ship steering, motor drives, and industrial robots, Adaptive 
techniques are used both in single-loop controllers and in general-purpose pro- 
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cess control systems in the process industry. Most industrial processes are 
controlled by PID controllers, and a large industrial plant may have thousands 
of them. Many instrument engineers and plant personnel are used to select, in- 
stall, and operate such controllers. In spite of this, many controllers are poorly 
tuned. One reason is that simple, robust methods for automatic tuning have 
not been available. Adaptive methods are now available for automatic tuning of 
PID controllers. This is in fact one of the fastest-growing areas of application 
for adaptive control. 

However, adaptive techniques are still not widely used; the technology is not 
mature. Because of involvement of commercial enterprises in adaptive control, 
it is not always possible to find out precisely what is being done. Various ideas 
are hidden in proprietary information that is carefully guarded. 

This chapter is organized as follows. An overview of some applications is 
given in Section 12.2, A number of commercial products that use adaptation are 
presented in Sections 12.3 and 12.4. Some specific applications are presented 
in more detail in the sections that follow. Ship steering, automobiles, and 
ultrafiltration are areas given special attention. 


12.2 STATUS OF APPLICATIONS 


Alarge number of experiments with adaptive control have been performed since 
the mid-1950s. The experiments have had different purposes: to verify ideas, to 
find out how adaptive systems perform, to compare different approaches, and 
to find out when they are suitable. The early experiments, which used ana- 
log implementations, were plagued by hardware problems. When digital pro- 
cess computers became available, they were natural tools for experimentation. 
Experiments with adaptive control required substantial programming, since 
adaptation was not part of the standard software. Applications proliferated 
with the advent of the microprocessor, which is a convenient tool for imple- 
menting adaptive systems. Adaptive techniques now appear both in single-loop 
controllers and as standard elements of large process control systems. There 
are tailor-made controllers for special purposes that use adaptive techniques. 


Feasibility Studies 


A number of feasibility studies have been performed to evaluate the useful- 
ness of adaptive control. They cover a wide range of control problems, such 
as autopilots for missiles, ships, and aircraft; engine control; motion control; 
machine tools; industrial robots; power systems; distillation columns; chemical 
reactors; pH control; furnaces; heating; and ventilation. There are also applica- 
tions in the biomedical area, The feasibility studies have shown that there are 
cases in which adaptive control is very useful and others in which the benefits 
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are marginal, Some industrial products also use adaptive techniques. There 
are both general-purpose controllers and controllers for special applications. 


Auto-tuning 


Simple controllers with two or three parameters can be tuned manually if there 
is not too much interaction between the adjustments of different parameters, 
but manual tuning is not possible for more complex controllers. Traditionally, 
tuning of complex controllers has taken the route of modeling or identification 
and controller design. This is often a time-consuming and costly procedure, 
which can be applied only to important loops or to systems that are to be 
manufactured in large quantities. 

All adaptive techniques can be used to provide automatic tuning. In such 
applications the adaptation loop is simply switched on. Perturbation signals 
may be added to improve the parameter estimation. The adaptive controller is 
run until the performance is satisfactory; then the adaptation loop is discon- 
nected, and the system is left running with fixed controller parameters. The 
particular methods for automatic tuning of PID controllers that were discussed 
in Chapter 8 have been found to be particularly attractive because they require 
little prior information and are closely related to standard industrial practice. 

Auto-tuning can be considered a convenient way to incorporate automatic 
modeling and design in a controller, It simplifies the use of the controller, 
and it widens the class of problems in which systematic design methods can 
be used cost-effectively. This is particularly useful for design methods such as 
feedforward that depend critically on good models. 

Automatic tuning can be applied to simple PID controllers as well as to 
more complicated systems. It is very convenient to introduce tuning into a 
DDC package because the tuning algorithm can serve many loops. Auto-tuning 
can also be included in single-loop controllers. For example, it is possible to 
obtain standard controllers in which the mode switch has three positions: 
manual, automatic, and tuning. A well-designed auto-tuner is very easy to 
use, even for unskilled personnel. Experience has shown it to be useful both 
for commissioning of new systems and for routine maintenance. Auto-tuners 
can also be used to enhance the skill of the instrument engineers. Automatic 
tuning will probably also be a useful feature of more complicated controllers. 


Automatic Construction of Gain Schedules 


Gain scheduling is a very useful technique, but it has the drawback that it 
may be quite time- and cost-consuming to build a schedule. Auto-tuning can 
conveniently be used to build gain schedules. A scheduling variable is first 
determined. The parameters that are obtained. when the system is running in 
one operating condition are then stored in a table together with the scheduling 
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variable. The gain schedule is obtained when the process has operated at a 
variety of operating conditions that covers the operating range. 


True Adaptive Control 


The adaptive techniques may, of course, also be used for genuine adaptive 
control of systems with time-varying parameters. There are many ways to do 
this. The operator interface is important, since adaptive controllers also have 
parameters that must be chosen. Controllers without any externally adjusted 
parameters can be designed for specific applications, in which the purpose of 
control can be stated a priori. The ship steering autopilot discussed in Sec- 
tion 12.6 is a typical example. In many cases, however, it is not possible to 
specify the purpose of control a priort. It is at least necessary to tell the con- 
troller what it is expected to do. This can be done by introducing dials that give 
the desired properties of the closed-loop system. Such dials are characterized 
as performance-relaied. New types of controllers can be designed by using this 
concept. For example, it is possible to have a controller with one dial, labeled 
with the desired closed-loop bandwidth. Another possibility would be to have a 
controller with a dial that is labeled with the weighting between state deviation 
and control action in an LQG problem. Adaptation can also be combined with 
gain scheduling. A gain schedule can be used to get the parameters quickly 
into the correct region, and adaptation can then be used for fine-tuning. 


Adaptive Feedforward 


In many applications it is possible to measure some of the disturbances acting 
on the process. Feedforward control is very useful when there are measur- 
able disturbances. With feedforward it is possible to decrease the influence of 
disturbances substantially. However, feedforward control, being an open-loop 
compensation, requires good models of process dynamics. Identification and 
adaptation therefore appear to be prerequisites for effective use of feedforward 
compensation, Until now, very little research and development have been done 
on adaptive feedforward, even if it was used in the early applications of self- 
tuning regulators. 


Abuses of Adaptive Control 


An adaptive controller is more complex than a fixed-gain controller, since it is 
nonlinear. Before we attempt to use an adaptive controller, it may therefore be 
useful to investigate whether the problem can be solved with a robust constant- 
gain controller, as discussed in Chapter 10. As was pointed out in Chapter 1, it 
is not possible to judge the need for adaptation from the variations in the open- 
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loop dynamics. The open-loop responses may vary much while the closed-loop 
responses are close and vice versa. 

The complexity of the controller has to be balanced against the engineering 
effort required to make the system operational. Experience has shown that only 
a modest effort is required to make a standard adaptive system work well. 


12.3 > INDUSTRIAL ADAPTIVE CONTROLLERS 


A number of industrial products incorporate adaptive control techniques. The 
products can be divided into 


e Tuning tools for standard controllers, 

« Adaptive standard process controllers, 

« General-purpose toolboxes for adaptive control, and 
+ Special-purpose adaptive controllers. 


Because of the large number of different products, it is possible to give only 
some examples from the different categories. 


Tuners for Standard Process Controllers 


There are many products for tuning of standard controllers of PID type. Leeds 
and Northrup announced a PID controller with a self-tuning option in 1981. 
SattControl in Sweden announced auto-tuning for PID controllers in a small 
DDC system in 1984 and a single-loop controller with auto-tuning in 1986. 
Practically all PID controllers that come on the market today have some kind 
of built-in automatic tuning or adaptation. There are four main solutions for 
the tuners for standard controllers: 


+ A parametric model approach, 

e A nonparametric model approach, 

e External tuning devices, and 

e Tuning tools in distributed control systems. 


The main idea in the parametric model controllers is to make an experiment, 
usually in open loop, and estimate a first- or second-order model with time 
delay. The input signals are usually steps, but pulses or pseudo-random bi- 
nary sequence (PRBS) signals are also used. The parameters of a PI or PID 
controller are then determined by using empirical tuning rules or a pole place- 
ment technique. Typical products in this category are Protonic from Hartman 
& Braun and UDC 6000 from Honeywell. 

In the nonparametric model approach, a point on the Nyquist curve is gen- 
erally estimated by using relay feedback. Compare the auto-tuning discussed 
in Chapter 8. On the basis of this information a modified set of Ziegler-Nichols 
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tuning rules are used to determine the parameters of the controller. SattCon- 
trol ECA40 and Fisher-Rosemount DPR900 are typical of this category. 

The tuning aids discussed above are built-in features in the standard 
controllers. The operator initiates tuning by pushmg a button or giving a 
command, The external tuning tools are special types of equipment that are 
connected to the process for the tuning or commissioning and then removed. 
The experiments are usually done with the process in open loop. The external 
tuner then determines suitable controller parameters. The new parameters are 
often entered manually by the operator. Since the external tuner can be used 
for different types of standard controllers, it must have detailed knowledge 
about the parameterization and implementation of algorithms from different 
manufacturers. Examples of external tuning tools are Supertuner from Toyo 
Systems in Japan, Protuner from Techmation in Arizona, PIDWIZ from BST 
Control in Illinois, and SIEPID from Siemens in Germany. 

Tuning tools have also been introduced in distributed control systems. Be- 
cause of the available computing power, it is possible to have very good human- 
machine interfaces and several options for tuning. Honeywell has a system 
called Looptune; Fisher-Rosemount Systems has a product called Intelligent 
Tuner. 


Adaptive Standard Process Controllers 


The tuners discussed above do not tune the controllers continuously but only on 
demand from the operator. However, there are also standard controllers with 
adaptation, which can follow changes in the parameters of the process. The 
adaptive standard controllers can be divided into 

ə» A parametric model approach, 

e A nonparametric model approach, and 

e A pattern recognition approach. 


The model-based adaptive controller usually estimates a first- or second-order 
model with time delay using a recursive least-squares algorithm. A pole place- 
ment controller with PID structure can then be determined. Examples are the 
Bailey Controls CLC04 and Yokogawa SLPC-I81, -281. 

One example of a nonparametric adaptive controller is SattControl ECA 
400. (See Fig. 1.23.) It is a development of the relay-based auto-tuner. One point 
of the Nyquist curve is estimated continuously by using band-pass filtering. 
The parameters of the controller are then determined by using a modified 
version of the Ziegler-Nichols tuning rules. 

Expert systems or pattern recognition have also been used for adaptive 
tuning of standard controllers. The first was the Foxboro EXACT, which was 
announced in October 1984. This controller is described in more detail in 
the text that follows. In 1987, Yokogawa announced adaptive PID controllers, 
SLPC-171 and SLPC-271, which have features similar to those of Foxboro’s 
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EXACT. Another controller in this category is Fenwal 570. The Honeywell 
UDC 6000 controller uses step response analysis for automatic tuning and 
a rule base for adaptation. These controllers are designed to capture the skill 
of an experienced control engineer in rules. About 100-200 rules are typically 
implemented. The controllers are waiting for changes in the reference value or 
large upsets of the process. On the basis of the response and the tuning rules, 
the parameters of the controller are modified to increase the performance of 
the closed-loop system. 

Several of the adaptive standard controllers, for example, Fisher DPR 910 
and SattControl ECA400, have adaptive feedforward and the possibility to build 
up gain scheduling tables automatically. These features are very useful and can 
improve the performance considerably. 

Standard controllers with more sophisticated control algorithms are now 
appearing on the market. One example is U.A.C. (Universal Adaptive Con- 
troller) from Process Automation Systems in British Columbia, which is based 
on predictive control. The controller can also handle multivariable systems. 


General-Purpose Toolboxes for Adaptive Control 


There is often a need to use more elaborate control algorithms than the stan- 
dard PID controllers. It is then necessary to estimate higher-order models and 
to have the possibility to use different design algorithms. To cover these situa- 
tions, general toolboxes for adaptive control have been developed. The adaptive 
algorithms are usually modules or blocks in more general packages for direct 
digital control (DDC). Asea Brown Boveri presented a general-purpose adap- 
tive controller in 1982. First Control Systems in Sweden introduced an adaptive 
controller in 1986. It is also possible to implement adaptive control in modern 
distributed control systems. 


PLC Implementations 


Adaptive controllers can also be implemented in ordinary programmable logic 
controller (PLC) systems. Such solutions are used by manufacturing companies 
with competent in-house expertise. For example, 3M has implemented adaptive 
controllers in this way. The first installation was made in 1987. Currently, 
there are about 200 adaptive loops in operation, A wide range of processes 
are controlled. The systems are implemented on a variety of platforms such as 
General Electric, Modicon, Measurex, Square-D, and Reliance. Programming 
is done in Basic or C. The applications include standard loops for temperature, 
pressure, position, and humidity and more specialized loops associated with 
3M proprietary processes. The adaptive algorithms that are used are based on 
estimation of parameters in models having the structure 


A` (qy) = Bi (g> )ulé — d) + B3 (quot — d) 
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where v is a measurable disturbance. Polynomial A* has degree one or two, 
but polynomials Bj and Bj may have higher degree to cope with variable 
time delay. The parameters are estimated by a special gradient technique. The 
control design is a modified minimum-variance strategy. 


Special-Purpose Adaptive Controllers 


For many processes, extensive process knowledge is available. To make good 
control, it is advantageous to use as much a priori knowledge as possible. 
Structures of the model and knowledge of integrators or time constants can be 
used to design the controller and to facilitate the tuning. For instance, special- 
purpose adaptive controllers have been developed for ships, pulp digesters, 
motor drives, ultrafiltration, and cement raw material mixing. 


12.4 SOME INDUSTRIAL ADAPTIVE CONTROLLERS 


Some representative commercial products and their features are described in 
this section. Special emphasis is put on properties such as estimation, prior 
information, and industrial experiences. The section ends with a discussion of 
some general aspects of industrial use of adaptive controllers. 


SattControl ECA40 and Fisher Control DPR 900 


This is the original auto-tuner based on relay oscillations, as described in 
Chapter 8. It was first introduced in a small (about 45 loops) DDC system 
for industrial process control SDM20. In this application the tuner can be 
connected to tune any loop in the system. Relay auto-tuning is also available in 
single-loop PID controllers (SattControl ECA40 and Fisher Control DPR900). 
In these controllers, tuning is done on demand by pushing a button on the 
front panel, so-called one-button tuning. The controllers are also provided with 
facilities for gain scheduling. There is a table with three controller settings. 


Parameter Estimation. The ultimate period and the ultimate gain are deter- 
mined by an experiment with relay feedback. The fluctuations in the output 
signal are measured, and the hysteresis of the relay is set slightly wider than 
the noise band. The initial relay amplitude is fixed. The amplitude and period 
are measured for each half-period. A feedback adjusts the relay amplitude so 
that the limit cycle oscillation has a given amplitude. When two successive half- 
periods are sufficiently close, PID parameters are computed, and PID control 
is initiated automatically. 

Control Design. When the ultimate gain and the ultimate period are known, 
the parameters of a PID controller can be determined by a modified Ziegler- 
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Nichols rule. There is also a limited amount of logic to determine whether 
derivative action is needed. 


Prior Information. A major advantage of the auto-tuner is that no parameters 
have to be set a priori. To use the tuner, the process is simply brought to an 
equilibrium by setting a constant control signal in manual mode. The tuning 
is then activated by pushing the tuning button. The controller is automatically 
switched to automatic mode when the tuning is complete. Different control 
objectives may be obtained by modifying the parameters in the Ziegler-Nichols 
rule. One mode is chosen by default, but the user can request a slower or an 
extra-fast response. 


Industrial Experiences. The system has been considered very easy to use, even 
by inexperienced personnel. Both the auto-tuning and gain-scheduling features 
have been found to be very useful. In many applications the auto-tuner has con- 
tributed significantly to improved tuning. It has also been demonstrated that 
commissioning time can be shortened significantly by using automatic tuning 
and that the standard controller can be applied to processes having a wide 
range of time scales. Simplicity is the major advantage of the auto-tuner. This 
has proved particularly useful for plants that do not have qualified instrument 
engineers and for operation during the night shift, when instrument engineers 
are not available. It is also easy to explain the auto-tuner to the instrument 
engineers. The properties of the auto-tuner are illustrated by an example. 


EXAMPLE 12.1 Level control 
Figure 12.1 shows the behavior of the controller when it is used to control the 


level of a vessel in a pulp mill. A controller with pure proportional action was 
used originally, resulting in the steady-state error shown in the figure. The 


Tuning Output level 
130 s 


P function PI function Setpoint change of 4% 


Control signal 


Figure 12.1 Results obtained when using the SattControl ECA40 for level 
control in a pulp mill. 
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tuning took about two minutes and resulted in a PI controller. This example 
illustrates the usefulness of the logic for selecting control action. Figure 12.1 
also shows the control signal. o 


EXACT: The Foxboro Adaptive Controller 


This controller is based on analysis of the transient response of the closed- 
loop system to setpoint changes or load disturbances and traditional tuning 
methods of the Ziegler-Nichols type. 


Parameter Estimation. Assuming controller parameters such that the closed- 
loop system is stable, a typical response of the control error to a step or impulse 
disturbance is shown in Fig. 12.2. Heuristic logic is used to detect that a proper 
disturbance has occurred and to detect the peaks e1, e2, and eg and period T,. 
The estimation process is simple, but it is based on the assumption that the 
disturbances are steps or short pulses. The algorithm can give wrong estimates 
if the disturbances are two short pulses because 7, will then be estimated to 
be the distance between the pulses. 


Control Design. The control design is based on specifications on damping, 
overshoot, and the ratios 7;/T, and Tz/Tp, where T; is the integration time, 
Ta is the derivative time, and Tp is the period of oscillation. The damping is 
defined as 


and the overshoot as 


In typical cases, both d and e must be less than 0.3. Empirical rules are used to 
calculate the controller parameters from Tp, d, and o. These rules are based‘on 
traditional tuning rules of the Ziegler-Nichols type, augmented by experiences 
from controller tuning. 


Figure 12.2 Typical response of control error to step or impulse distur- 
bances. 
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Prior Information. The tuning procedure requires prior information about the 
controller parameters Ke, T;, and Ty. It also requires information on the time 
scale of the process. This is used to determine the maximum time the heuristic 
logic waits for the second peak. Some measure of the process noise is also 
needed to set the tolerances in the heuristic logic. Some parameters may also be 
set optionally: damping d, overshoot o, maximum derivative gain, and bounds 
on the controller parameters. 


Pre-tuning. The tuning procedure requires reasonable controller parameters 
to be known so that a closed-loop system with a well-damped response is 
obtained. There is a pre-tune mode that can be used if the prior information 
that is needed is not available. A step test is done in which the user specifies 
the step size. Initial estimates of the controller parameters are determined 
from the step, and the time scale and the noise level are also determined. The 
pre-tune mode can be invoked only when the process is in steady state. 


Industrial Experiences, Thousands of units of EXACT controllers are in use 
today. The system is also available in Foxboro’s system for distributed process 
control. Users from a large number of installations have reported favorably, 
citing the ease with which controllers can be well tuned and the ability to 
shorten commissioning time. It is also mentioned that derivative action can 
often yield significant benefits. 


Eurotherm Temperature Controller 


Temperature control is traditionally done with simple PID controllers, which 
are cheaper than conventiona) industrial controllers. Auto-tuning is now also 
used in such simple systems. One example is controllers produced by Eu- 
rotherm in the United Kingdom. A modified relay tuning is used in those 
controllers. Full control power is used until an artificial setpoint is reached. 
Two half-periods of a relay tuning are then used, and the controller param- 
eters are calculated from the transient. The controller also has facilities for 
automatic on-line tuning based on transient response analysis. 

In temperature control loops, there are usually different dynamics depend- 
ing on whether the temperature is increasing or decreasing. This nonlinearity 
can be handled by using gain scheduling. 


Asea Brown Boveri (ABB) Adaptive Controller 


The Asea Brown Boveri (ABB) adaptive controller was first marketed under 
the name Novatune. It is an adaptive controller that is incorporated as a 
part of ABB Master, a distributed system for process control. The system is 
block-oriented, which means that the process engineer creates a system by 
selecting and interconnecting blocks of different types. The system has blocks 
for conventional PID control, logic, and computation. Three different blocks, 


510 Chapter 12 Commercial Products and Applications 


ON LOAD AUTO REG- SOFT 
AD 


ON AUTO 


UEXT 
Self- 
FB tuning U 
adaptive 
per TEY ator 


PY MAX MIN T 


PL PU PY MAX MIN TPN NANB NC ED INT 


Figure 12.3 Block diagrams of the adaptive modules STAR1 and STAR3, 
available in the ABB adaptive controller. 


called STARI, STAR2, and STAR3, are adaptive controllers. The adaptive 
controllers are self-tuning regulators based on least-squares estimation and 
minimum-variance control. The controllers all use the same algorithm; they 
differ in the controller complexity and the prior information that must be 
supplied in using them. 

The ABB adaptive controller differs from the controllers that were dis- 
cussed previously in that it is not based on the PID structure. Instead, its al- 
gorithm is based on a general pulse transfer function. It also admits dead-time 
compensation and feedforward control. The ABB adaptive controller system 
may be viewed as a toolbox for solving control problems. 


Principle. The ABB adaptive controller is a direct self-tuning regulator simi- 
lar to Algorithm 4.1 in Section 4.3. The parameters of a discrete-tume model are 
estimated by using recursive least squares. The control design is a minimum- 
variance controller, which is extended to admit positioning of one pole and a 
penalty on the control signal. The block diagrams in Fig. 12.8 show two of the 
adaptive modules. The ABB adaptive controller system has three adaptive mod- 
ules: STARI, STAR2, and STARS. STARS is the most complicated. The simpler 
ones have fewer inputs and have default values on some of the parameters in 
STAR3. In the block diagram the input signals are shown on the left and top 
sides of the box, the output signals on the right, and the parameters on the 
bottom. The parameters can be changed at configuration time. The parameters 


12.4 Some Industrial Adaptive Controllers 511 


PL, T, and PN can also be changed on-line. 

The simplest module, STAR1, has three input signals: the manual input 
UEXT, the measured value FB, and the setpoint REF, It has three parameters. 
The variable PY is the smallest relevant change in the feedback signal; the 
adaptation is inhibited for changes less than PY. The parameters MAX and 
MIN denote the bounds on the control variable, and T is the sampling period. 

The module STAR2 has more input signals. It admits a feedforward signal 
FF. There are also four signals, HI, LO, DH, and DL, that admit dynamic 
changes on the bounds of the control variable and its rate of change. There are 
also additional parameters: PN, for a penalty on the control variable, and KD, 
which specifies the prediction horizon. The module also has two additional 
mode switches: REGAD, which turns off adaptation when false, and SOFT, 
which allows a soft start. 

The module STAR3 has an additional function LOAD, which admits pa- 
rameters stored in an EEPROM to be loaded. It also has several additional 
parameters, which admit positioning of one pole PL and specification of con- 
troller structure NA, NB, NC, and INT. 


Parameter Estimation. The parameter estimation is based on the model 
(1 — PLq™)y(¢ + KD) — (1 — PL)y(¢) 
= A*(q”")Ay(t) + B*(g™)Au(t) + C*(q™)Av(?) 


where A’, B*, and C* are polynomials in the delay operator g@', y is the 
measured variable, u is the contro] signal, v is a feedforward signal, and A is 
the difference operator 1—q™. (Compare with Algorithm 3.6.) The integers NA, 
NB, and NC give the number of coefficients in the polynomials A*, B*, and C*, 
respectively. The number PL is the desired pole location for the optional] pole. 
When parameter INT is zero, a similar model without differences is used. The 
parameters are estimated by using recursive least squares with a forgetting 
factor A = 0.98. Parameter estimation is suspended automatically when the 
changes in the control signal and the process output are less than PU and PY. 
The parameter updating may also be suspended on demand through the switch 
REGAD., In combination with other modules in the ABB adaptive controller 
system, this constitutes a convenient way to obtain robust estimation. 


Control Design. The control law is given by 


(p + B(q7*)) Au(t) = (1 — PL) (ue(t) — y) — A*a Ayt) — C (g>) Ale) 
where p is a penalty factor related to PN. Since the algorithm is a direct 


self-tuner, the controller parameters are obtained directly from the estimated 
parameters. 


Industrial Experiences. The ABB adaptive controller has been applied to a 
wide range of process control problems in the steel, pulp, paper, and petro- 
chemical industries, wastewater treatment, and climate control. Some appli- 
cations have given spectacular improvement of performance compared to PID 
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control. This is particularly the case for processes with time delay, and in ap- 
plications in which adaptive feedforward can be used. It Has also been used 
to make special-purpose systems for special application areas such as paper 
winding and climate control. Some ABB adaptive controller applications are 
described in more detail in Section 12.5. The essential drawback of the ABB 
adaptive controller is that it is based on a direct self-tuner. This means that 
the sampling period and the parameter KD have to be chosen with care. It 
may, for example, be difficult to use very short sampling periods. 


Firstloop: The First Control Adaptive Controller 


The adaptive system Firstloop was developed by First Control Systems, a 
small company founded by members of the Novatune team. Firstloop is a 
small controller module with up to eight self-tuning regulators. The system 
is a toolbox with modules for adaptive control, logic, filtering square root 
functions, and operator communication. An interesting feature is that the 
adaptive controller is the only controller available in the system. However, by 
choosing the number of parameters of the estimated model, it is possible to 
get different controller structures—for instance, a PID controller. The adaptive 
controller can tune ten parameters with a sampling period of 20-50 ms. The 
software admits easy configuration of a control system. The First Control 
controller is shown in Fig. 12.4. Firstline is a distributed process control system 
with a block-oriented language for control design. The adaptive controller is 
incorporated as a standard function module. We will describe the adaptive 
control module in detail. 


Principle. The adaptive control unit used in Firstloop and Firstline is based 
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Figure 12,4 The MicroController from First Control. (With courtesy of First 
Control Systems AB.) 
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oD recursive estimation of a transfer function model and a control law based 
on indirect pole placement. The controller also admits feedforward. The main 
advantage of using an indirect pole placement algorithm is that the system can 
be applied to nonminimum-phase systems and systems with time-varying time 
delays. This also implies that short sampling periods can be used. (Compare 
the discussion in Section 6.9.) The adaptive module comés in two versions, 
a standard module and an expert module. The standard module is intended 
for use by ordinary instrument engineers who are not specialists in adaptive 
control. The expert module shown in Fig. 12.5 is intended for specialists in 
adaptive control. Many parameters are given default values in the standard 
module. The variables that must be specified are shown in Fig. 12.5. The signal 
connections are measured value MV, setpoint SP, external control signal UE, 
feedforward FF1, FF2, and controller output U. The mode switches ON, AUTO, 
and ADAPT are for on/off, auto/normal, and adaptation on/off, respectively. 
Parameters UMAX and UMIN define the actuator range. Variables HI, LO, 
DUP, and DUM specify the limits on the control signal that are used internally 
in the controller. The performance-related parameters are POLE, which gives 
the desired closed-loop pole, and BMPLVL, which gives the admissible initial 
change of the contro] variable at mode switches. 
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Figure 12.5 The expert module STREGX in Firstloop. 
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The desired closed-loop pole is the major variable to be selected. The choice 
of this variable clearly requires knowledge of the time scales of the process. 
The recommended rule of thumb is to start with a large value and gradually 
decrease it. 


Parameter Estimation. The parameters of a transfer function model are esti- 
mated. Systems with variable time delay can be captured, provided that a large 
number of b parameters are used. Up to 15 parameters can be estimated in 
the modei. The number of parameters in the model is specified by NA, NB, and 
NC. Common factors in the pulse transfer function are canceled automatically. 


Control Design. The control design is based on pole placement. The desired 
response is characterized as a first-order system with delay. The remaining 
poles are positioned at the origin. The design of the algorithm is based on 
solving the Diophantine equation by a method that cancels common factors 
in the estimated polynomials. An LQG-based algorithm is also available. The 
details of the control design are proprietary. 


Safety Network. The algorithm is provided with extensive safety logic. Adap- 
tation is interrupted when variations in measured signals and control signals 
are too small. The limits are given with the parameters RESU and RESY. 
Adaptation is also interrupted when the control error is below a certain limit, 
and there are safeguards to ensure that the influence of a single measurement 
error or sudden large disturbance is limited. (Compare Section 6.9.) Measured 
values that result in large model errors are also given a low weight automati- 
cally. The details of the safety logic are not available. Different models can be 
stored for use in different situations. The controller is initialized by a model 
number equal to MODEL when LOAD changes from false to true. 


Industrial Experiences. Firstloop and Firstline are used in a number of high- 
performance process control systems. They include control of pulp mills, paper 
machines, rolling mills, and pilot plants for chemical process control. 


Discussion 


The products described give an idea of how adaptive techniques are used 
in commercial products. Additional insight can be derived by analyzing the 
existing products and trends. Experience from the applications clearly indicates 
the need for tuning and adaptation; there are undoubtedly many control loops 
that are poorly tuned. This results in loss of energy, quality, and effective 
production time. It is also of interest that many different techniques are used, 
and there are also promising adaptive algorithms that have not yet reached 
the marketplace. A few specific issues will be discussed in more detail. 


Computing Power. Industrial use of adaptive methods has been possible be- 
cause of the availability of microprocessors. Most of the commercial systems 
are based on 8-bit processors, with their inherent limitations in addressing 
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capability. This applies to all the PID auto-tuners and the first version of the 
ABB adaptive controller that used less than 64 kbyte of memory. With 16-bit 
processors and larger address spaces it is possible to use more sophisticated 
algorithms and better human-machine communication. The PID auto-tuners 
typically run with sampling rates of 10-50 Hz. 


Intentional Perturbation Signals. To estimate parameters, it is necessary to 
have data with variations in the control signal. Such variations can be gen- 
erated naturally or introduced intentionally. Natural perturbations can occur 
because of disturbances or poorly tuned controllers. Intentional perturbations 
can be introduced when natural perturbations are not present, as suggested by 
dual control theory. This method is used in several of the auto-tuning schemes. 
If prior information about the system dynamics is available, it is possible to 
find signals that are optimal for the purpose of estimating parameters. Relay 
feedback automatically generates an input signal! having a lot of energy at the 
frequency at which the process has a phase lag of 180°. Although intentional 
perturbation signals are both useful and justified by theory, they are often 
controversial. It should be remembered, however, that poorly tuned controllers 
may also be considered perturbations. 


Controller Structures. Different controller structures are used in the commer- 
cial systems. There are both PID controllers and general transfer function 
systems that admit feedforward and compensation for dead time. The main 
advantage of the PID structure is that it is close to current industrial practice. 
Within the PID family there are cases in which derivative action is of little 
benefit. Systems like the SattControl ECA40 can determine this and choose PI 
action automatically. However, there is no system that can choose the controller 
structure generally, although it seems possible to design such systems. 

The benefits of feedforward control from measurable disturbances have 
been known for a long time. Experience with the ABB adaptive controller 
and Firstloop clearly shows the benefit of adaptive feedforward control. Since 
feedforward control critically depends on a good model, adaptation is almost a 
prerequisite for feedforward control. Adaptive controllers like the ABB adaptive 
controller and Firstloop use a controller structure that is a general transfer 
function model like 


R(q)u(é) = Ti(q)u.(t) + Te(q)v(t) — S(a) (2) (12.1) 


where u is the control variable, u, is the command signal, v is a measured 
disturbance, and y is the controlled output. The polynomials R, S, Tı, and Tz 
can be chosen so that the controller corresponds to a PID controller. However, 
the controller modeled by Eq. (12.1) can also be much more general than a 
PID controller. It can incorporate many classical features such as filtering, 
disturbance models, Smith predictors, and notch filters. For more demanding 
control problems the general transfer function controller thus has significant 
advantages over the PID controller. However, more expertise in control engi- 
neering is needed to understand and interpret the parameters of a controller 
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like Eq. (12.1). Since the PID controller is so common, we can expect it to 
coexist with more general controllers for a long time. 


Multivariable Control. Multivariable control problems can be handled to a 
limited extent by using the feedforward feature in the ABB adaptive controller 
and Firstloop. None of the commercial systems admit truly multivariable adap- 
tive control. Up to now there have not been many applications of adaptive con- 
trol to true multivariable systems. This situation can be expected to change 
significantly because of the substantial interest in model predictive control. 


Pre-tuning. It is interesting to note that many schemes have been provided 
with a pre-tuning feature. In some cases it appears that this was added after- 
wards. The reason is undoubtedly that too much user expertise is required for 
the standard algorithms. The selection of sampling periods or the equivalent 
time scales is a typical example. It appears that the relay method for automatic 
tuning would be an ideal method for pre-tuning. 


Tuning Automatically or on Demand. The existing products include systems 
in which tuning is initialized on demand from the operator or automatically. 
Users of both schemes have documented their experiences. It appears that 
there are a number of processes for which controllers should be retuned for 
different operating conditions. In many cases there are measurable signals 
that correlate well with the operating conditions. In these cases it seems 
that the combination of on-demand automatic tuning with gain scheduling 
is a good solution. This will give systems that change parameters faster than 
systems with adaptation. Of course, it is convenient to have tuning initiated. 
automatically, but it is difficult to give general guidelines for when tuning 
should be initiated. The simple schemes that are currently in use are often 
based on simple level detection. Further research is required to find conditions 
for retuning; this is discussed further in Section 13.4. 

An analysis of the division of labor between human and machine gives 
another viewpoint on the question of on-demand or automatic tuning. When 
tuning is done on demand of the operator, the ultimate responsibility for tuning 
clearly remains with the operator or the instrument engineer. This responsibil- 
ity is carried even further in some systems, in which the instrument engineer 
has to acknowledge the tuned values before they are used. A good solution 
would be a system in which the responsibility and the tuning techniques could 
be moved from the operator to the computer system. Ideally, the system should 
also allow the operator to learn more about control in general and the particular 
process in question. Experimental architectures that allow this are available, 
but not in commercial systems. 


Requirements for the User 


The requirements for the user are very different for the various commercial 
systems. The PID controllers in which tuning is initiated automatically require 
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very little. Controllers with on-demand tuning require somewhat more knowl- 
edge on the part of the user. Systems such as the ABB adaptive controller and 
Firstloop can be regarded as toolboxes for solving control problems that are 
more demanding. They also allow complex control systems to be configured. 
This is clearly illustrated by the experiences from ABB adaptive controller in- 
stallations, The system was designed by a very qualified team that included 
several first-rate Ph.D.s. The design team was also responsible for many of the 
initial installations, which were extremely successful. 

More recent versions of the toolbox systems are much easier to use. 
Moderate-sized systems have been successfully implemented by instrument 
engineers with little knowledge of advanced control. There are several reasons 
for the increased user-friendliness of the systems. The safety logic has been 
improved significantly; modules in which many parameters are given default 
values have been designed; and computer-based configuration tools, with a lot 
of knowledge built in, have been developed. The toolboxes thus allow a user to 
get started quickly with a modest knowledge of adaptive control, and they also 
make it possible for a user to construct more advanced systems when more 
knowledge is acquired. 


12.5 PROCESS CONTROL 


There are many applications of adaptive control in the field of industrial process 
control. Some typical examples are discussed in this section. The applications 
give insight into how adaptive control can be used in practice. 


Temperature Control in a Distillation Column 


Although the SattControl auto-tuner has been used mostly for conventional 
loops for control of flow, pressure, and level, it has also been applied to more 
difficult problems. One example is temperature control in a distillation col- 
umn, This is a conventional control loop in which the temperature in a tray of 
a distillation column is measured and the boil-up is manipulated. This control 
loop was part of a process system with many loops. There had been severe 
problems with the temperature control for a long time, and several attempts 
had been made to tune the loop. Figure 12.6 shows a recording of the temper- 
ature. The figure shows that the loop is oscillatory with the controller tuning 
that was used (K, = 8, T; = 2000, and Tz = 0). Also notice the long period 
of the oscillation. The controller was switched to manual at time 11:30, and 
the temperature then started to drift. Auto-tuning was initiated at time 14:00. 
The tuning phase was completed after six hours at time 20:00, when the con- 
troller was automatically set to automatic control. The controller parameters 
obtained were K, = 1.3, T; = 4800, and Ta = 1100. Notice that the whole 
tuning procedure is fully automatic. The only action taken by the operator was 
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Figure 12.6 Application of the SattControl ECA40 to temperature control 
in a distillation column. 


to initiate tuning at time 14:00. The temperature variations during tuning are 
not larger than those obtained with the conventional controller settings. The 
example shows that the auto-tuner can cope with a process having drastically 
different time scales than those normally used. l 


Chemical Reactor Control 


Chemical reactors are typically nonlinear. Characteristics such as catalyst ac- 
tivity change with time, as does the raw material. There are often inherent 
time delays, which may vary with production level. Poor control can result in 
lower product quality, damage to the catalyst, or even explosions in exothermic 
reactors, Chemical reactors are therefore potential candidates for adaptive con- 
trol. The process in this application consists of two parallel chemical reactors 
in which ethylene oxide is produced by catalytic oxidation. of ethylene. The pro- 
cess is exothermic and time-variable because of changes in catalyst activity. 
It is essential to keep the temperature accurately controlled; a reduction of 
temperature variations improves the yield and prolongs the life of the catalyst. 
Stable steady-state operation is also a first step toward plant optimization. 
The plant was equipped with a conventional control system that used 
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Figure 12.7 Schematic diagram of the reactor. 


PID controllers to control flow and temperature. The plant personnel were 
dissatisfied with the system because it was necessary to switch the controllers 
to manual control in case of many major disturbances, which could happen 
several times per day. 

A schematic diagram of the process is shown in Fig. 12.7. The reactor is 
cooled by circulating oil to a cooler. The temperature of the coolant at the 
inlet to the reactor is the primary controlled variable, and the reactor outlet 
temperature and the coolant flow are also measured. The control signal is 
the flow to the cooler. The dynamics relating temperatures and flow to valve 
openings have variable delays and gains. 

Disturbances in the process are caused by variations in the incoming gas 
and load changes. Large disturbances occur with changes in production level or 
with “shutdowns” caused by failure in surrounding process equipment. During 
shutdowns it is most important to maintain the process temperature as long as 
possible so that the production can be restarted easily. With the conventional 
control system, temperature fluctuations were around +0.5°C during normal 
operation and up to +2°C during larger disturbances. With adaptive control 
the variations were reduced to +0.1°C during normal operation and +0.5°C 
during large upsets. 

The adaptive control system was implemented by using the ABB adaptive 
controller and the STAR3 module with feedforward from the reactor outlet 
temperature. By using the other modules in the system, it was also straight- 
forward to handle the dual valves and to reset to manual mode for startup and 
shutdown. The system has been in continuous operation since 1982 on a reac- 
tor at Berol Kemi AB, which produces 30,000 tons per year. The operational 
experiences with the system have been very good. With adaptive control, it was 
possible to reduce the temperature fluctuations significantly. The controllers 
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are now kept in automatic mode most of the time, even during production 
changes. This has made it possible to revise operational procedures, since op- 
erators do not have to spend their time supervising the reactor temperature. 


Pulp Dryer Control 


Drying processes are common in the process industries. The mechanisms in- 
volved in drying are complex and poorly understood, and their dynamics depend 
on many changing factors. There are often significant benefits in improved reg- 
ulation, since an even moisture content is an important quality factor. There 
are also significant potential energy savings. Drying processes are thus good 
candidates for adaptive control. 

In pulp drying, a wet pulp sheet passes a steam-heated drying section 
and cooling section. A typical system is shown schematically in Fig. 12.8. The 
moisture content of the sheet entering the dryer is about 55%. At the exit, it 
is typically 10-20%. It takes about nine minutes to pass the dryer and about 
half a minute to pass the cooler. The dryer dynamics are complicated. It is 
influenced by many factors, such as the pH of the sheet. The measurements of 
the moisture content are obtained by a traversing microwave sensor that moves 
back and forth across the pulp sheet, describing a diagonal pattern on the sheet. 
When one traverse movement is complete, the mean value of the diagonal is 
stored in the computer, the mean value algorithm is reset, the sensor moves 
back, and the procedure repeats itself. It takes a little less than one minute 
for the sensor to move across the sheet. With manual control, the fluctuations 
in moisture content often exceed +1%. The ABB adaptive controller was used 
in this application. The control system configuration is shown in Fig. 12.8. 
The moisture control is carried out by an adaptive software controller STAR. 
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Figure 12.8 Schematic diagram of pulp drying and the control system. 
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The moisture content measured by the traversing system is low-pass filtered 
and connected to the FB input of the STAR. The desired moisture content 
is chosen by the operator outside the ABB adaptive controller and software 
connected to the REF input of the STAR. The pH value, measured in an earlier 
process section, is used as the feedforward signal. This signal is connected to 
the FF input of the STAR. The control signal of the STAR defines the desired 
steam pressure, which is measured and controlled to the desired value by a 
conventional hardware PI controller. The control signal of this controller acts 
continuously on the steam flow valve. 

The sampling period used in the adaptive controller was 3.5 minutes. A 
fourth-order Butterworth filter was used as an anti-aliasing filter. This was 
implemented by using the ABB adaptive controller tools. When the production 
rate was changed, large upsets were noticed, lasting for about 30 minutes, 
because it took 5-15 samples for the adaptive controller to settle. It was highly 
desirable to reduce these upsets, and this was done by introducing a special 
production rate compensation in the form of a pulse transfer function of the 


type 


z—a 
This gives a rapid change of the steam pressure when pulp speed changes. 
It was not necessary to make this filter adaptive. The system has been in 
operation since 1983 at a pulp mill at Mérrum’s Bruk that produces 330,000 
tons of paper pulp per year. The operational experiences have been very good. 
Fluctuations in moisture content have been reduced from 1% to 0.2%, which 
improves quality. It also allows the setpoint to be moved closer to the target 
value, resulting in significant energy savings. 


Control of a Rolling Mill 


The process control applications are typical steady-state regulation problems. 
The rolling mill control problem is much more batch-oriented. It illustrates the 
use of adaptive techniques in machine control, There are many types of rolling 
mills, each with its specific control problem. This particular application deals 
with a skin pass mill located at the end of the production line. The material 
processed by the mill may vary significantly in dimension and hardness. 

The purpose of the mill is to influence quality variables such as hardness 
and yield limit. A schematic diagram of the process is shown in Fig. 12.9. Let 
vy be the speed of the strip entering the mill, and let va be the speed of the 
strip at the exit. Because of the thickness reduction, the exit speed is larger 


than the entrance speed. The elongation is defined as 
t= vo — 04 
v1 


The key control problem is to keep a constant elongation. There is a difficult 
measurement problem, since the velocity difference is so small. The process 
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Figure 12.9 Schematic diagram of the rolling mill and the control system. 


operates over a wide range of conditions; the following operating modes can be 
distinguished: 


e Slow rolling at low speed during startup, 

ə Acceleration to fast rolling, 

e Fast rolling at production speed, 

» Intermediate decelerations to slow rolling or even to standstill, 
« Deceleration to slow rolling at the end of the strip, and 

« System at rest waiting for the next strip. 


Transition from one mode to another is performed automatically on demand 
from the operator. It is essential that the control system handle these tran- 
sitions well. The process dynamics relating elongation to roll force can be de- 
scribed as a high-order dynamical system with an open-loop response time of 
less than 0.05 s. Changes in production rate from 0 to 2000 m/min in less 
than 10 s are typical. The dynamics change drastically during the operation; 
the dynamics of rolling change because of variations in the speed, hardness, 
and dimension of the strip. There are also significant changes of the inertia of 
the coilers. All material starts on one coiler and ends up on the other. There 
are variations in the oil film on the roller bearings due to variations in speed 
and pressure. The dynamics of the hydraulic system vary with the operating 
point. 

The changes in dynamics due to changing speed are predictable and can 
(in principle) be taken care of by gain scheduling. Variations in dimension can 
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be handled similarly. The hardness cannot be measured directly on-line, so it 
must be handled by feedback and adaptation. 

The ABB adaptive controller was used in this application. A block diagram 
of the control system is shown in Fig. 12.9. The speed variations are taken care 
of in an elegant way. In the ABB adaptive controller, sampling can be triggered 
by an arbitrary signal. In this case it is triggered by the pulse counters that 
measure strip speed. This means that sampling is related to the length of the 
strip, not to time. This is a simple way of making the control system invariant 
to strip speed (the same idea was used in the ship steering example in Section 
9.5). The measurement of the velocity difference is implemented by using pulse 
generators and counters. 

For each strip a saved model is loaded into the controller, and the adapta- 
tion is switched on with some delayed action (15 sampling intervals) to avoid 
adaptation during the first few steps, in which the measurement is irregular. 
The initial model is taken from a soft strip so that there will be no excessive 
control action at startup. Soon enough, the controller will adapt to the condi- 
tions of the new strip. Figure 12.10 illustrates a typical run of a strip. Notice in 
particular how well the system copes with the velocity variations and with the 
mode changes. The installation of the system took about a week, mostly devoted 
to function and signal checking and tests. The controller functioned almost im- 
mediately when connected to the process. After that, approximately two days 
were devoted to checking and tuning performance. This involved experiments 
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Figure 12.10 Elongation, roll force, and strip speed during a typical run 
with the system. 
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Figure 12.11 The cold rolling mill at Avesta-Sheffield is controlled by First 
Control’s adaptive system. (With courtesy of Avesta-Sheffield, precision strip 
AB, Kloster.) 


with different sampling rates. 

A significant part of the installation time also involved other parts of 
the system, particularly the logic. Operational experiences with the adaptive 
control system have been very favorable. The variation in elongation was better 
than is found with a conventional system, and the adaptive system also settled 
faster during mode switches. The system has been in continuous operation 
since 1983. 

Figure 12.11 shows a cold rolling mill at Avesta-Sheffield in Långshyttan, 
Sweden. The process is controlled by First Control’s adaptive control system 
since 1990. The adaptive regulators keep the deviations in the strip thickness 
within 2-3 um, which is considered to be very accurate for this kind of mill. 


Pulp Digester 


Control of the pulp digester is an important part in manufacturing of chemical 
pulp. The raw material is wood chips, which are broken down into fibers by 
processing in a liquor composed of sodium hydroxide and sodium sulfide (white 
liquor). The process operates either in batches or, more commonly today, as a 
continuous process. 

The Kamyr digester (see Fig. 12.12) is the standard continuous process. 
The production rate is determined by the chip meter, which feeds chips into the 
top of the digester. The flow of pulp from the digester is controlled by the blow 
flow at the bottom. The digester has three zones: impregnation, cooking, and 
washing. The dynamics that describe the material transport and the chemistry 
in the digester is very complicated. The total residence time in the digester is 
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Figure 12.12 Schematic diagram of the chip level controller for a continuous 
Kamyr digester. 


about 5 hours. An important control problem is the control of the chip level, 
which is controlled by the blow flow. The chip level signal is calculated from 
three strain gauges by using a scheme developed by MoDo Chemetics. The 
study reported here is a feasibility study made by Pulp and Paper Research 
Institute of Canada (Paprican) and the pulp company MacMillan Bloedel in 
Vancouver. The study has resulted in an adaptive controller for digester control 
developed in cooperation between MoDo Chemetics in Vancouver and Paprican. 
The commercial adaptive controller manipulates two inputs (blow flow and chip 
meter) as indicated in Fig. 12.12; in the feasibility study, only the blow flow 
was manipulated by the adaptive controller. 

The industrial digester in the study produced 350 tons per day of kraft pulp. 
Two grades, R and K, are manufactured. From identification experiments it 
was found that the digester can be described by the model 


(1+ aq Ay = (bo + big™ + bag) Au(t — 2) + (14 erg + cog *)e(t) 


where Ay(t) is the difference in level, Au(t) is the change in blow flow, and 
e(t) is white noise. The sampling period is 5 minutes. The identification ex- 
periments also indicated that it would be feasible to use fixed values of all 
the parameters except the 6,’s. The adaptive controller will thus be able to 
compensate for gain changes and changes in the time delay of the process. A 
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Figure 12.13 Autocovariance of the chip level under conventional and adap- 
tive control. (With courtesy of Paprican.) 


GPC algorithm with N, = 1, N, = 1, and Ne = 15-20 is used (see Eq. 4.61). 
Figure 12.13 shows the autocovariance of the level when conventional PID 
control and adaptive control were used. The chip level signal is essentially un- 
correlated after three lags (15 minutes). The standard deviation of the level 
decreased from 11.3% to 8.6%. This improvement of the chip level leads to di- 
rect improvements in pulp quality. Table 12.1 shows permanganate number 
(P-number), which is a standard laboratory test of the residual lignin in the 
pulp. The P-number is closely related to the kappa number. The P-numbers 
were measured on samples from the blow flow collected once every two hours. 
For both grades (R and K) the average P-numbers are closer to the target 
values, and the standard deviations are reduced. 
In summary, the advantages of the adaptive controller are 


a Reduction of chip level and P-number variability, 

e Reduced need for operator intervention, 

« Elimination of manual retuning, and 

e Prediction of potential problems with hang-ups in the chip column. 


The pulp digester study is an example of a special-purpose adaptive controller. 
The process model is tailored to fit the specific application, and the parameters 
can be related to physical parts of the system. 

Pulp digesters have also been controlled by standard adaptive controllers. 
One example is the Vallvik mill at Assi Domän in Sweden where Novatune 
controllers in an ABB Master system are used extensively. Several Novatunes 
are used to control temperatures, flows and levels. The system was installed 
and commisioned by the regular mill staff with a core team of two enthusiastic 
engineers. The critical parameters in the Novatune were the sampling period 
and the prediction horizon; these values had to be selected individually for each 
application. Default values were used for the other parameters. The prediction 
sampling period is typically chosen to be 60% to 90% of the dead time, the 


12.6 Automobile Control §27 


Table 12.1 P-number variability under conventional and adaptive chip level 
control. (With courtesy of Paprican.)} 


Controller Grade Setpoint Mean Std. dev. No days 


Conventional R 23.0 22.2 2.06 23 
K 21.0 19.9 1.91 ` 6 

Adaptive R 23.0 22.4 1.76 18 
K 21.0 20.6 1.70 q 


predictions horizon is chosen as KD = 2 and the controller complexity as 
NA = NB = NC = 8. The standard procedure is to run the controllers in 
manual mode. The parameter estimation is switched on with restrictions on 
the control action, which are gradually removed. 

The experience with adaptive control has been very good. Control perfor- 
mance is significantly better with adaptive control than with PID control. The 
systems have not required much attention after installation. The reason for 
improved performance is that tighter control is obtamed with adaptive control. 
Experiments at the plant indicated that there was a good correlation between 
variations in chip level and the kappa-number. By introducing adaptive control 
of the chip level it was also possible to significantly reduce the variation in the 
kappa-number. The standard deviation was reduced from 0.52 to 0.30. It has 
also been observed that the adaptive controllers recover much faster from large 
upsets than the systems used previously. 


12.6 AUTOMOBILE CONTROL 


Microprocessor-based engine control systems were introduced in the automo- 
tive industry in the 1970s to address the demands of increased fuel economy 
and reduced emissions. Early electronic control systems had modest applica- 
tion. Today, the powertrain computer accomplishes a multitude of control tasks, 
including vehicle speed or “cruise” control, idle speed regulation, automatic 
transmission shift actuation, control of various emission-related systems, fuel 
control, and ignition timing, as well as many diagnostic functions. These highly 
I/O intensive systems must be cost effective and function acceptably in many 
thousands of vehicles with attendant manufacturing variability over a wide 
range of operating conditions. 

Many of the control functions in automobiles are open-loop look-up table 
oriented. Some automatic calibration methods have been developed to optimize 
table entries with respect to fuel economy, constrained by emissions. Typical 
closed-loop structures are comprised of individual operational loops, often PI 
or PID, and may contain several feedforward paths that are designed to re- 
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Figure 12.14 This Ford Mustang has state-of-the-art adaptive power train 
controls. (With courtesy of Ford Motor Company.) 


ject measurable or predictable disturbances. Applications of adaptive control 
concepts can be found in many of those powertrain control functions where on- 
line self-tuning techniques are used to adjust controller parameters (in most 
cases, the feedforward parameters) to compensate for component and operat- 
ing condition variability. One such adaptive control structure is the air-fuel 
ratio control method introduced by Ford in the mid-1980s to reduce sensitivity 
to component variability and calibration inaccuracy. Figure 12.14 shows a car 
with adaptive power train control. 

Modern automobiles require precise control of air-fuel ratio to attain 
high catalytic converter efficiency and minimize tailpipe emissions. Air-fuel 
ratio control has two principal components: a closed-loop portion in which 
the fuel injectors are regulated in response to a signal fed back through a 
digital PI controller from an exhaust gas oxygen sensor located in the engine 
exhaust stream, and an open-loop or feedforward portion in which fuel flow 
is controlled in response to an estimate of the air charge entering the engine. 
(Compare Section 9.5.) This open-loop portion of the control is particularly 
important during engine transient when the inherent delay of the engine 
and exhaust system obviate the effectiveness of feedback, and during cold 
engine operation before the exhaust gas oxygen sensor has reached operational 
temperature. The purpose of the adaptive algorithm is to adjust the open-loop 
feedforward gain to reduce deviation from stoichiometric air-fuel ratio operation 
and improve emission performance under open- and closed-loop operation. This 
is essentially a gain scheduling process in which an adaptive multiplier is 
stored in a look-up table as a function of engine speed and load. Initially, all 
the table entries are unity. As the engine operates throughout its range, the 
appropriate cell values are increased or decreased to correct for parametric 
changes or inaccuracies in the initial calibration. In contrast to typical gain 
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scheduling techniques, this adaptation is continuous throughout the vehicle’s 
life. 

Another application of adaptive control at Ford is in the area of auto- 
motive speed control or “cruise” control, These systems must provide accept- 
able steady-state error, excellent disturbance rejection, unnoticeable throt- 
tle movement, and must be robust to vehicle-to-vehicle variability and op- 
erating condition. An adaptive control design based on sensitivity analysis 
and gradient methods has been used to continuously tune the gains of a 
PI controller. This was accomplished by constructing a single quadratic cost 
function and adjusting the proportional and integral control gains to min- 
imize this function. Additional modifications, such as projection and dead- 
band together with slow adaptation, were used to avoid parameter drift and 
ensure robustness. In this manner, speed control performance is optimized 
for individual vehicles and operating conditions providing improved perfor- 
mance and reduced cahbration effort compared to conventional fixed gain con- 
trollers. 


12.7 SHIP STEERING 


A conventional autopilot for ship steering is based on the PID algorithm. Such 
a controller has manual adjustment of the parameters of the PID controller 
and often also a dead zone called weather adjust—a simple version of a 
performance-related knob. Manual adjustments are necessary because the 
dynamics of a ship vary with speed, trim, and loading. It is also useful to 
change the autopilot settings when disturbances in terms of wind, waves, 
currents, and water depth are changed. Adjustment of an autopilot is a 
burden on the crew. A poor adjustment results in unnecessarily high fuel 
consumption. It is therefore of interest to have adaptive autopilots. A ship 
steering autopilot, Steermaster 2000 from Kockum Sonics AB in Sweden, and 
a roll damping equipment, Roll-Nix from SSPA Maritime Consulting AB in 
Sweden and Hyde Marine Systems in Ohio, are described in this section. 


Ship Steering Dynamics 


Simple ship steering dynamics were presented in connection with the discus- 
sion of gain scheduling in Section 9.5. That section detailed how the dynamics 
vary with the velocity of the ship and showed how the variations could be re- 
duced by gain scheduling. It has been shown by hydrodynamic theory that the 
average increase in drag due to yawing and rudder motions can be approxi- 
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mately described by 


ea = k (p? + 48”) (12.2) 


where R is the drag and ¥? and §? denote the mean square of heading error 
and rudder angle amplitude, respectively. The parameters k and A will depend 
on the ship and its operating conditions. The following numerical values are 
typical for a tanker: 

k=0.014deg ° A= 1/12 


It is thus natural to use the criterion 


T 
V= = f (VO -vrer +480) at (12.3) 


as a basis for the design and evaluation of autopilots for steady-state course 
keeping. The disturbances acting on the system are due to wind, waves, and 
currents. A detailed characterization of the disturbances and their effect on 
the ship’s motion is difficult. In a linearized model, disturbances appear as 
additive terms. It is common practice to describe them as random signals; the 
waves have a narrow band spectrum. The center frequency and the amplitude 
may vary significantly. 


Autopilot Design 


An autopilot has two main tasks: steady-state course keeping and turning. 
Minimization of drag induced by the steering is the important factor in course 
keeping, and steering precision is the important factor in turning. It is therefore 
natural to have a dual-mode operation. These two modes are described in the 
text that follows, together with the basic autopilot functions. 

The influence of variations in the speed of the ship is handled by g gain 
scheduling. The other disturbances are taken care of by feedback and adap- 
tation. Implementation of the gain scheduling is discussed in Section 9.5. It 
requires a measurement of the forward velocity of the ship. If disturbances 
are regarded as stochastic processes, steady-state course keeping can be de- 
scribed as a linear quadratic Gaussian problem. It is then natural to estimate 
an ARMAX model (Eq. 2.38). The particular process model used is 


Ay (t) —aAy(é —h) = bis (t — h) + ba(t — 2h) + ba (t — 3h) 
+ elt) + cie(t — h} + cge(t — 2h) (12.4) 


This model is built on Nomoto’s approximation (compare Section 9.5). The ad- 
ditional b term was introduced to allow additional dynamics to be captured 
as an increased time delay. The difference occurs because there is a pure in- 
tegration in the model from rate of turn to heading angle. A control law that 
minimizes the criterion of Eg. (12.3) is then computed by using the certainty 
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equivalence principle. This approach requires the solution of a Riccati equa- 
tion, which can be done analytically in the particular case. A straightforward 
minimum-variance control law was used in some early experiments. This was 
replaced by the LQG control law described previously, because there were sig- 
nificant advantages at short sampling intervals, which could not be used with 
the minimum-variance control law. The sampling interval in the model is set 
during commissioning. 


« 


Turning Controller 


The major concern in turning is to keep tight control of the motion of the ship, 
even at the expense of rudder motions, For high turning rates the dynamics 
of many ships are nonlinear. The normal course-keeping controller can handle 
small changes in heading, but it cannot handle large maneuvers because of the 
nonlinearities discussed previously. A special turning controller was therefore 
designed. The controller is a high-gain controller in which the feedback is of 
PID type. (Compare Fig. 1.3.) Appropriate PID parameters are determined 
during commissioning. The model used is nonlinear. It is designed so that the 
command signal is turning radius. The turning rate is thus r = u/R, where u 
is the speed of the ship and R is the turning radius. 


Human-Machine Interface 


The fact that turning radius is used as a command signal instead of turning 
rate simplifies maneuvering considerably, because it is easy to determine an 
appropriate turning radius from the chart. It also improves path following, 
since the speed of the ship may change during a turn. This is then compensated 
for automatically. The man-machine interface is very simple. There is one 
joystick to increase and decrease the heading. An optional joystick provides 
override control; whenever this is moved, it gives direct control of the rudder 
angle. Control can be transferred to the autopilot by a reset button. In making 
a turn, the desired turning radius is set by increase-decrease buttons. The turn 
is initiated when the joystick is moved to the new desired course. The turn is 
then executed, and the ship turns until the desired course is reached. The 
fixed-gain controller is used during the turn, and the adaptive course-keeping 
controller is initiated when the turn is complete. 

There are no adjustments on the course-keeping controller; everything is 
handled adaptively. Some default values are set during commissioning, but 
the fixed-gain controller can be activated when the operator pushes a switch 
labeled fixed control. This is typically used when there are heavy waves coming 
from behind (called a quartering sea). This condition makes steering difficult 
because the effective rudder forces are small and the disturbing wave forces 
are large, 
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Operational Experiences 


Early versions of the autopilot were field-tested in 1978, and the product 
was announced in 1979. The product is used in various kinds of ships. One 
installation, in a ferry that navigates between Stockholm and Helsinki, has 
been in continuous operation since 1980. It uses adaptive control all the time. 
The ability to cope with large variations in speed has been found to be very 
useful, and the turning radius feature is particularly useful for navigation in 
archipelagos, where a lot of maneuvering is necessary. Figure 1.24 indicates the 
improvements in course-keeping that can be obtained through adaptation. The 
decreased drag with the data shown in the figure corresponds to a reduction 
in fuel consumption of 2.7%. 


Rudder Roll Damping System 


On many ships it is desirable to reduce the roiling motion. Conventional 
roll damping systems on large naval ships use active fins or active as 
well as passive tanks. These systems are expensive to install, especially for 
retrofits. A third approach to roll damping is to use the rudder for roll damp- 
ing as well as for maneuvering. High-frequency movements of the rudder 
damp the rolling without influencing the mean value of the heading of the 
ship. Such a system can be inexpensive, since it can easily be connected 
to the ordinary steering system. One such system, Roll-Nix, has been de- 


Roll-Nix 


Adaptive Kalman filter 
Adaptive autopilot 
Turning regulator 
Adaptive roll damping 


Course set-point 


Figure 12.15 Block diagram of the Roll-Nix roll damping system. (With 
courtesy of SSPA Maritime Consulting AB.) 
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veloped by SSPA Maritime Consulting in Gothenburg, Sweden. The system 
is also marketed by Hyde Marine Systems in Cleveland, Ohio. A block di- 
agram of the system is shown in Fig. 12.15. Roll-Nix includes an adaptive 
Kalman filter, an adaptive course-keeping autopilot (optional), a high-gain 
turning controller (optional), and an adaptive roll damping controller. The first 
three parts are similar to those described for the Steermaster 2000 autopi- 
lot. 

The system uses a roll rate sensor together with course gyro and speed 
log to determine rudder commands that are superimposed on the ordinary 
autopilot, commands and fed into the steering engine. The operating principle 
is that the roll movements created by the rudder are opposite those of the 
roll movements caused by the waves. These counteractive moves damp the roll 
motions of the ship. In designing the roll damping system it is important to 
have quick rudder motions. Slow and large motions will influence the course 
keeping. The Roll-Nix system is provided with an autopilot as an option. The 
adaptive feature of the roll damping system is necessary to handle different 
weather conditions and ship speed. The Kalman filter is used to obtain an 
accurate roll motion signal from the measured rol) rate. 

Roll-Nix has been tested on several types of ships. For instance, the system 
has been tested on two Royal Swedish Navy ships: one attack craft and one 
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Figure 12.16 Results from sea trials with an attack craft at 27 knots and 
stern quartering seas (4 Beaufort): (a) without Roll-Nix; (b) with Roll-Nix. 
The significant roll angle was reduced by 58%, and the maximum roll angle 
was reduced by 53%. (With courtesy of SSPA Maritime Consulting AB.) 
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mine layer. The sea trials show that a significant roll reduction of 45—60% can 
be obtained for both the standard deviation and the maximum angle of the 
roll. The result from a sea trial with an attack craft is shown in Fig. 12.16. 
The roll reduction increases with increasing speed and rudder rates. Tests 
were also done on the mine layer HMS Carlskrona in September 1987. The 
following quote from the captain, Commander Hallin, gives an illustration of 
the performance of the system. l 


This particular occasion was when the ship was off the Dutch coast, 
bound for Helder, with seas coming in from astern on the port quarter. 
I was resting in my cabin. The time was 04.00 hrs. Suddenly I sensed 
that the ship had started to roll perceptibly, and I wondered what 
was going on. At once, I went up on deck and asked the officer of the 
watch what on earth was happening, and what the reason was for this 
sudden increase in the ship’s rolling motion. I was surprised to receive 
the reply, “We have just switched off the Roll-Nix. We need to have 
some data without Roll-Nix working, to see how much damping can be 
achieved.” I think that that is the most illustrative experience I have 
had of the Roll-Nix system to date. 


12.8 ULTRAFILTRATION 


Patients with little or no renal function need some form of artificial blood 
purification to stay alive. In dialysis the blood is cleansed of waste products 
and excess water, and the electrolytes in the blood are normalized. More than 
350,000 patients all over the world undergo this treatment a couple of times a 
week, In its most common form, hemodialysis, the blood flows past a semiper- 
meable membrane with a suitably composed dialysis fluid on the other side. 
Because of the large number of different dialyzers that are on the market, the 
control algorithm in the dialysis machine must be able to handle a wide span 
in process gain and other process characteristics. 

An adaptive pole placement controller has been used in the fluid control 
monitor (FCM) developed by Gambro AB in Lund, Sweden. The system has 
been in use for many years and it has performed very well. This is probably 
one of the most widely used adaptive controllers in the world today. In this 
section we describe the system. 


Process Description 


A schematic view of the Gambro AK-10 dialysis system is shown in Fig. 12.17. 
Only the parts that are relevant to flow and pressure control are shown in 
detail. Clean water is heated to around 87°C, and salt is added to physiological 
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Figure 12.17 Schematic diagram of a dialysis system. 


concentration. A pressure drop in the restrictor is created by the first pump 
to degas the solution. The restrictor and the first pump (P1) determine the 
flow into the dialyzer. Because of the compressibility of the air in the bubble 
chamber, flow changes to the dialyzer will be slowed down by a time constant. 

After passing a few measuring devices and a valve, the fluid leaves the 
dialysis fluid monitor (DFM) and passes the first flow-measuring channel of 
the FCM before entering the dialyzer. Before returning to the DFM, the second 
measuring channel of the FCM is passed. In the DFM a few more measuring 
devices and valves are passed before the second pump (P2). A restrictor is 
placed on the outlet to allow positive pressures in the dialyzer. 

To maintain a specified transmembrane pressure, the DFM has a control 
system that is based on a conventional fixed-gain digital PI controller. (See the 
block diagram in Fig. 12.18.) This controller has a sampling period of 0.16 s and 
an integration time of about 30 s. The purpose of the fluid control module is to 
control weight loss during the treatment. This 1s done by the external control 
loop shown in Fig. 12.18, which has the flow difference Qp as the measured 
variable and the setpoint to the pressure controller p, as the control variable. 


Process Dynamics 


The dialyzer dynamics can be approximately described by the model 
C= Qr-Bp 


where p is the transmembrane pressure and @; is the net fluid flow from the 
dialyzer. The constant C is the compliance. Parameter B, which represents the 
static gain, may, for example, vary from 1.6- 107} to 120. 107"? (mè s7} Pa™}), 
that is, a gain variation by a factor of 75, 
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Figure 12.18 Block diagram of the system for controlling transmembrane 
pressure p and the flow difference Q; control system. 


The complete dynamics of the pressure loop can be approximately described 
as a second-order transfer function. It has one pole associated with the dy- 
namics of the ultrafiltration and another associated with the pressure control 
system. The PI controller is tuned conservatively so that both poles are real. 
The dominating time constant is 30-50 s. The transfer function from the pres- 
sure setpoint to the flow @; then also has the same poles, but it also has a 
zero corresponding to the pole s = —B/C of the ultrafiltration (see Fig. 12.18). 
This zero can change significantly with the type of dialysis filter used. A con- 
sequence is that there is a drastic difference in the dynamics obtained for 
different filters. 

The main function of the system is to control the total water removal V 
during the treatment. The water removal is given by 


dV 
=e (12.5) 


An Earlier Control System 


An earlier system used a PI controller in the outer loop. Because of the large 
gain variations, it was necessary to use a conservative setting with low gain. 
This resulted in very sluggish control of the weight loss. Experiments with 
various simple forms of gain adjustment did not solve the problem and it was 
decided to test if an adaptive controller was feasible. 


Adaptive Control 


The adaptive controller was designed as an indirect adaptive pole placement 
algorithm. 
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Parameter Estimation. The dynamics can be expected to be of third order, 
representing the dynamics of the pressure loop and the dynamics of the filter 
introduced to filter the flow signal. This filter has a time constant of about 30 s. 
Experiments with system identification indicated, however, that data could be 
fitted adequately by 


Q p(t) = a@r(t — h) + bi pelt —h) + be p(t — 2h) (12.6) 


where Qy is the filtration flow and p, is the setpoint of the pressure loop. This 
model represents first-order dynamics with a time delay. A sampling interval of 
5 s was found to be suitable. The parameter estimation was made on differences 
to avoid problems with a constant level in the signals. 

The estimated steady-state gain is an important parameter. With a low 
estimated gain, the gain in the controller will be large. It is therefore advanta- 
geous to have the sum of the b parameters as one of the estimated parameters 
so that it is easy to set a lower limit to the estimated gain. This has been done 
in the FCM by using the regression vector 


(Q(t) pelt) pelt} — pelt —h)) 
instead of 
(Qr(t) pelt) pe(t—h)) 

If the estimated gain becomes too small, the estimate is stopped at the limit. 

A constant forgetting factor of 0.999 is used to track slowly time-varying 
parameters. To improve numerics, only the diagonal elements of the covariance 
matrix F are divided by this factor. It is well known that the equation for P(é) 
may be sensitive to numerical precision when a forgetting factor is used. This 
is because the eigenvalues of the P-matrix may be widely separated. Several 
methods to handie this problem were described in Chapter 11 and in the 
discussion of the ABB adaptive controller and Firstloop in this chapter. In this 
case the problem was avoided by careful scaling, and an ordinary recursive 
least-squares methed could be used. 


Control Design. A conventional pole placement algorithm and a design that 
guarantees integral action were used. (See Section 3.6.) Several factors influ- 
ence the choice of desired closed-loop poles. If a smooth control is desired in the 
steady state, the speed of setpoint changes should not be set too high. Second, 
the first step response at startup must not be quicker than the time required 
to get a reasonable model. A reasonable response time in accumulated flow is 
one hour. The other closed-loop poles, which correspond to flow changes, were 
specified by time constants of 25 s, 15 s, and 15 s. 

The controller can be reparameterized to correspond to a PID controller 
with a filtered derivative part. The structure was chosen so that the controller 
corresponds to a discrete-time PID controller in which the reference signal 
enters only the P and I parts. This corresponds to 8 = 1 in Eq. (8.3) in 
Chapter 8. A possible common factor in the estimated model was canceled 
before entering the design calculations. 
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Special Design Considerations 


Control of fluid removal during dialysis has a direct influence on the patient’s 
well-being. This imposes heavy demands on the control system. Several safety 
features have been included. Smooth performance from the first moment of con- 
trol is essential. This can be achieved by a careful choice of certain parameters, 
as we discuss next. 


Filtering. The measured flow signal is corrupted by measurement noise. Since 
a new value is available every second, it is possible to filter the signal. With 
a sampling period for control of 5 s, it was found to be suitable to use a first- 
order filter with a time constant of 30 s to filter flow and accumulated flow 
before using the values in the control algorithm. This smocths the control signal 
considerably without preventing fairly quick setpoint changes. 


Limits on Setpoint Changes. Both the absolute level and the rates of setpoint 
changes were limited on the basis of physical constraints. The PID controller 
was provided with conventional anti-windup protection to avoid problems with 
saturation. Parameter updating is also interrupted when the pressure setpoint 
is kept constant at a limit. At startup, when the model parameters may be far 
from their best values, it is also wise to prevent the control algorithm from 
changing the control signal (i.e., the pressure setpoint) too rapidly. The rate 
limit on the pressure setpoint prevents this; experience has shown that this 
limit is hit only rarely. 


Startup. A critical moment for an adaptive controller is the start, before the 
model parameters have been accurately estimated. It was required that its step 
response be almost perfect from the beginning. For this reason, most of the 
development time was spent in adjusting the parameters to ensure a smooth 
start. The following parameters were then found to be important: 

e Initial values of the parameter estimates, 

a Initial values of the covariance matrix P, 

« The desired closed-loop poles, 

» The time allowed for signals to settle before estimation and control starts, 

s Limits on the estimated parameters (especially the static gain), and 

» The limit on control changes (and control). 


The initial values of the parameter estimates are important, since they deter- 
mine the initial controller parameters. They were chosen to model a high-gain 
dialyzer, with an extra time delay, to give a cautious low-gain controller. This 
is perfect for a highly permeable (i.e., high-gain) membrane, but for normal 
membranes the pressure changes will be too small, a situation that is soon 
detected by the parameter estimator. 

It is important to choose the P matrix carefully. This determines the 
speed of parameter estimation. Values of P that are too large will make the 
estimates noisy, and there is a risk that the estimates may temporarily give 
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bad controllers. Also, a value of P that is too large can quickly eliminate the 
carefully chosen initial parameters in the estimator. With values of P that are 
too small the time needed to find a good model can be very long, a situation 
that is not at all acceptable. 

It was found to be advantageous to introduce a lower bound on the esti- 
mated gain in the model. With low-gain dialyzers there would otherwise be a 
tendency for the estimator to decrease the gain estimate too much, and the 
controller gain would be too high for a while. A suitable limit for the model 
gain could be determined from the known data of existing dialyzers. To facil- 
itate the checking of the estimated gain, a special form of the process model 
was used, The estimated pole was also bounded away from a pure integrator, 
since this pole enters the expression for the gain limit. 

The limit on the setpoint changes also helps to ensure a smooth startup. 
The desired closed-loop poles are important design parameters. The equivalent 
time constants should be chosen to be long enough to give the estimator time 
to find a good model before the setpoint is approached for the first time. They 
should also be as short as possible to give a rapid response to setpoint changes. 
The equivalent time constants of the closed-loop systems were chosen to be 720, 
five, and three sample intervals, which correspond to 1 hour, 25 s, and 15 s, 
respectively. Without the requirement of a smooth startup it would have been 
possible to speed up the desired closed-loop dynamics considerably. However, 
setpoint changes are not very frequent, and smooth startup is much more 
important than rapid setpoint changes. 

If by chance the desired pressure were already set at startup, there would 
be no pressure change that would help to improve the estimates of model 
parameters. Therefore there is a period of forced small pressure changes for 
the first eight minutes after a reset. This is accomplished by periodic changes 
of the setpoint every 45 s. 

With an adaptive controller it is very important to ensure that the es- 
timated model is never destroyed. Therefore the estimator should always be 
given true values for control and measured signals. If for some reason, such 
as an alarm situation causing the DFM to bypass the dialysis fluid, the con- 
trol signal is not allowed to do its job, the estimator must be turned off. The 
controller will then use the old estimates for a while. 

After all such breaks and at startup, a settling period is allowed, during 
which correct signals are entered into all the vectors but no estimation is done. 
This settling period is very important, especially at startup, when the estimates 
are most sensitive to changes in the signals, Errors in the signals also force the 
P-matrix to decrease rapidly, so future learning is slowed down considerably. 


Alarms. Appropriate alarms are an important part of any useful control sys- 
tem. An alarm indicates if the volume control error is too large and also if 
something is wrong in the dialysis fluid monitor or with the pipes. If there 
is a stop in the blood pipe from the dialyzer to the drip chamber, the blood 
pressure within the dialyzer will rise, causing a large ultrafiltration rate and 
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minimized pressure. The alarm in the FCM will then cause the DFM to enter 
a patient-safe condition. 


Operational Experience 


It has been possible to use the algorithm to handle ultrafiltration control 
for all kinds of dialyzers that are available today. Treatment modes such as 
single-needle or double-needle treatment or sequential dialysis with periods of 
isolated ultrafiltration have been tested. Dialyzers with variations in values of 
B by a factor of 75 have been tested in the laboratory without any problems. 
After a period of approximately five months of clinical trials at several clinics, 
full-scale production started in the autumn of 1986. Over 11,000 units had been 
delivered as of December 1993. Since every machine may be used in several 
hundred treatments each year, there is now extensive practical experience with 
this algorithm, which seems to work well under all kinds of conditions. 

Figure 12.19 shows responses in differential flow Q; to step changes in the 
setpoint Q re when the system is under adaptive control. The pressure p is also 
shown. Despite the noisy flow measurement, the response of the closed-loop 
system is very good. 
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Figure 12.19 Adaptive control of a dialysis system. Responses in differential 
flow Q; (solid line) and transmembrane pressure p to step changes in the 
setpoint @;, (dashed line) for a plate membrane. The adaptive control starts 
at t = 6. (With courtesy of Gambro AB.) 
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12.9 CONCLUSIONS 


In this chapter we have tried to give an idea of how adaptive techniques are 
used in real control systems. A few general observations can be made. 

Although there are many applications of adaptive control, it is clear that 
adaptive control is not a mature technology. The techniques were introduced 
in products in the early 1980s. Those in use today are mostly first-generation 
products; there are second-generation products in only a few cases. 

The description of the products and the real applications show clearly that 
although the key principles are straightforward, many “fixes” must be done to 
make the system work well under all possible operating conditions. The need 
for safety nets, safety jackets, or supervision logic is not specific to adaptive 
control. Similar precautions must be taken in all real control systems, but 
since adaptive control systems are complex to start with, the safety nets that 
are required can be quite elaborate. 

The examples clearly show that adaptive systems are not black box solu- 
tions that are a panacea, Rather, adaptive methods are useful in combination 
with other control design methods. Both in the rolling mill example and in the 
ship steering autopilot, adaptation was combined with gain scheduling. An- 
other example is the use of a feedforward signal in the pulp dryer to improve 
the adaptation transient. 

A third observation is that the human-machine interface is very important. 
A fourth observation is that some operating conditions are not conveniently 
handled by adaptive control. One example is the behavior of ship steering 
autopilots in a quartering sea. 

There are unquestionably many different adaptive techniques, but so far, 
only a few of them have been used in industrial products. In many cases the 
choices have not been made by comparing several alternatives; one method 
has been chosen quite arbitrarily. This means that many alternatives have not 
been tried. 

The computing power that is available has a significant influence on the 
type of control algorithms that can conveniently be implemented. The sim- 
ple auto-tuners use simple 8-bit microprocessors, whereas some of the more 
advanced systems use full 32-bit architecture. In most process control applica- 
tions there are no problems with computing time. The rolling mil applications, 
on the other hand, are quite demanding. The computing power that is avail- 
able also has a significant impact on what human-machine interface can be 
implemented. 

The applications also indicate the importance of the safety network. It 
is of interest to see the facilities provided in the toolbox and the specific 
solutions used in the dedicated systems. It is clearly much simpler to design 
a safety network for a dedicated system, in which good parameter bounds can 
be established. 

The applications described in this chapter and elsewhere indicate that 
there are three cases in which it is very useful to use adaptive control: 
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» When the system has long time delays, 
» When feedforward can be used, and 
» When the character of the disturbances is changing. 


In all these cases it is necessary to have a model of the process or the dis- 
turbances to effectively control the system. It is then beneficial to be able to 
estimate a model and to adapt to changes in the process. 
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CHAPTER 13 


PERSPECTIVES ON 
ADAPTIVE CONTROL 


13.1 INTRODUCTION 


In this final chapter we attempt to give some perspective on the field of adaptive 
control. This is important but difficult because the field is in rapid development. 
The starting point is a short discussion of some closely related areas that are 
not covered in the book. These include adaptive signal processing in Section 
13.2 and extremum control in Section 13.3. Particular attention is given to the 
field of adaptive signal processing, in which a cross-fertilization with adaptive 
control appears particularly natural. 

Adaptive regulators and auto-tuning have complementary properties. Auto- 
tuners require very little prior information and give a robust ballpark estimate 
of gross system properties. Adaptive regulators require more prior knowledge, 
but they can give systems with much improved performance. It thus seems 
natural to combine auto-tuning with adaptive control in systems that combine 
several algorithms. Apart from algorithms for control, estimation, and design, 
it may also be useful to include supervision. It seems logical to use an expert 
system to monitor and control the operation of such a system. Systems of 
this type have been called expert control systems and are briefly discussed in 
Section 13.4. The use of expert systems also provides a natural way to separate 
algorithms from logic that occurs in all control systems. 

Adaptation is related to learning; in Section 13.5 we discuss some early 
uses of learning in control systems and how it is related to adaptive control 
as we now understand it. In Section 13.6 we attempt to speculate on future 
directions in the theory and practice of adaptive control. 
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13.2 ADAPTIVE SIGNAL PROCESSING 


Automatic control and signal processing have strong similarities; similar math- 
ematical models and techniques are used in the two fields. However, there are 
also some significant differences. The time scales can be different. Signal pro- 
cessing often deals with rapidly varying signals, as in acoustics, in which sam- 
pling rates of tens of kilohertz are needed, In control applications it is often 
(but not always) possible to work with much slower sampling rates. 

A more significant difference is that time delays play a minor role in sig- 
nal processing. It is often permissible to delay a signal without any noticeable 
difficulty. Because control systems deal with feedback, even small time delays 
can result in drastic deterioration in performance. A third difference is in the 
industrial markets for the technologies. In signal processing, there are some 
standard problems that have a mass market, as in the field of telecommunica- 
tions. The control market is more diversified and fragmented. Adaptive control 
is used to design control systems that work well in an unknown or changing 
environment. The environment 1s represented by process dynamics and dis- 
turbance signals. Adaptive signal processing is used to process signals whose 
characteristics are unknown or changing. More emphasis is given in signal 
processing to fast algorithms. Although there have been attempts to bring the 
fields closer together, much more effort is needed in this direction. To illustrate 
this, we will describe a few typical adaptive signal processing problems. 


Prediction, Filtering, and Smoothing 


Prediction, filtering, and smoothing are typical signal processing problems, 
which can all be described as follows: Given two signals x and y and a filter F, 
determine the filter such that the signals y and 7 = Fx are as close as possible. 
The problem can be illustrated by the block diagram in Fig. 13.1. In a typical 
case we have 
x(t) = s(t) + v(t) and =e y(t) = s(t + 7) 

where s is the signal of interest and v is some undesirable disturbance. The 
problem is called smoothing if t < 0, filtering if t = 0, and prediction if t > 0. 
Solutions to such problems are well known for signals with known spectra and 
quadratic criteria. The corresponding adaptive problems are obtained when the 
signal properties are not known. All recursive parameter estimation methods 


Figure 138.1 Illustration of filtering, prediction, and smoothing. 
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Figure 13.2 (a) An adaptive system for filtering, prediction, or smoothing 
and (b) its simplified representation. 


can be applied to the adaptive signal processing problems. This is illustrated in 
Fig. 13.2, which gives a typical adaptive solution. The adjustment mechanism 
can be any recursive parameter estimator. The details depend on the struc- 
ture of the filter and the particular estimation method chosen. An example 
illustrates the idea. 


EXAMPLE 13.1 Output error parameter estimation 
Assume that the filter is represented as an ordinary pulse transfer function 


boz?! + biz? +--+ bni 
Fu T, Ta EE A: 


F(z) = 
To obtain a recursive estimator, the parameter vector 

C= (a, rae Gy bo wade bni) 
and the regression vector 

g(t- 1)=(—ĵ(t— 1i) ... —f(t—n) x(t—1) ... x(t—n)) 
are introduced. The error is then given by 
E(t) = y(t) — H(t) = x) ~ p7 (t — 1)Ê(t — 1) 

and the equation for updating the estimate is 


6(t) = 6(t — 1) + P(Dp(t — 1)e(t) o 


The special case of Example 13.1, obtained when the filter is an FIR filter 
and a gradient parameter estimation scheme is used, is particularly simple. 
This is the LMS algorithm. 
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Microphone for 
ambient noise 


Figure 13.3 Use of an adaptive filter for adaptive noise cancellation. 


Block Diagram Representation 


The block diagram in Fig. 13.2 represents a solution to a generic signal pro- 
cessing problem. To make it easy to build large systems, it is convenient to 
consider this module as a building block that can be used for many different 
purposes, This is simpler if a proper representation is used. For that purpose 
it is convenient to represent the module as a block that receives signals x and 
y and delivers estimates # and Ê. Such a representation, shown in Fig. 13.2(b), 
makes it possible to describe several adaptive signal processing problems. 


Adaptive Noise Cancellation 


Consider the situation of a mobile telephone in a car where there is a consider- 
able ambient noise. Assume that two microphones are used. One is directional 
and picks up the driver’s voice corrupted by noise; the other is directed away 
from the driver and picks up mostly the ambient noise. By connecting the mi- 
crophones to an adaptive filter as shown in Fig. 13.3, it is possible to obtain 
a signal that is considerably improved. Removal of power frequency hum from 
measurement signals is another application at adaptive noise cancellation. 


Adaptive Differential Pulse Code Modulation (ADPCM) 


Digital signal transmission is becoming important because of the rapid de- 
velopment of new hardware. Its use in ordinary telephone communication is 
increasing. Pulse code modulation (PCM) is the standard method for convert- 
ing analog signals to digital form. The analog signal is filtered and digitized by 
using an analog-to-digital (A-D)} converter. The digitized signal is then trans- 
mitted in serial form. If the A-D converter has B bits and the sampling is 
f Hz, the transmission rate required is fB bits/s. For standard voice signals, 
a sampling rate of 8 kHz is typically used. A resolution of 12 bits in the A-D 
converter is required to get good-quality transmission. The bit rate required is 
thus 96 kbit/s. By having an A-D converter with a nonlinear characteristic it 
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Figure 13.4 Block diagram of a differential pulse code modulation (DPCM) 
system. 


is possible to reduce the bit rate to 64 kbit/s, which is the standard for digital 
voice transmission, 

It is highly desirable to reduce the transmission rate, because more com- 
munication channels are then obtained with the same transmission equipment. 
The bit rate can be reduced significantly by using differential pulse code mod- 
ulation (DPCM). In this technique the innovations of the signal are computed 
as £ = y—¥, where Ẹĵ is generated by filtering the innovations through a pre- 
dictive filter. Only the innovations are transmitted (see Fig. 13.4). The receiver 
has a prediction filter with the same characteristics as the filter in the sender. 
The signal # can then be reconstructed in the receiver. The bit rate that is 
required is reduced significantly because fewer bits are required to represent 
the residual. It has been shown that for voice signals, a resolution of 4 bits is 
sufficient. This means that the bit rate required for the transmission can be 
reduced from 64 kbit to 32 kbit. 

The prediction filter depends on the character of the transmitted signal. 
Substantial research into the characterization of speech has shown that it 
can be well predicted by linear filters. However, the properties of the filter 
will change with the particular sound that is spoken. To predict speech well, 
it is thus necessary to make the filters adaptive. The transmission scheme 
obtained is then called adaptive differential pulse code modulation (ADPCM). 
Such a scheme, which uses an adaptive filter based on the output error method, 
is shown in Fig. 13.5. Notice that the adaptive filters at the transmitter and 
the receiver are driven by the residual only. If the filters in the receiver and 
the transmitter are identical, the filter parameters will automatically be the 
same. The adaptive filters have therefore been standardized by CCITT (Comité 
Consultatif Internationale de Télégraphique et Téléphonique). The filter that is 
used has the transfer function 


H(z) = Soe thet ++ + bs 
A(z? 4+ az + a2) 


The regression vector associated with the output error estimation is 


p(t) =(—x(t) —x(f—1) e) ... e(t—5)) 
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Figure 18.5 Block diagram of an adaptive differential pulse code modula- 
tion system. 


and the associated parameter vector is 
@= (ai ag bo wee bs } 


The standard least-squares estimator is of the form 
6(t + 1) = O(t) + P(t + Yo(te(t +1) 


Several drastic modifications are made to simplify the calculations. A constant 
value of the gain is used. The multiplication is avoided by just using the signs 
of the signals. Leakage is also added to make sure that the estimator is stable. 
The updating of the parameters b; is then given by the sign-sign algorithm 


b;(t) = (1—28) 82) + 2-7 sign (e(t — i)) sign (e(t)) (13.1) 


Similar approximations are made in the other equations. The computations 
in Eq, (13.1) are very simple. They can be done by shifts and the addition 
of a few bits, which can be accomplished with a small VLSI circuit. The 
CCITT ADCPM standard was achieved after significant experimentation. It 
is a good example of how drastic simplifications can be made with good 
engineering. 


13.3 EXTREMUM CONTROL 


The control strategies that have been discussed in the book have mainly been 
such that the reference value is assumed to be given. The reference value is of- 
ten easily determined. It can be the desired altitude of an airplane, the desired 
concentration of a product, or the thickness at the output of a rolling mill. On 
other occasions it can be more difficult to find the suitable reference value or 
the best operating point of a process. For instance, the fuel consumption of a 
car depends, among other things, on the ignition angle. The mileage of the car 
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Performance 


Figure 13.6 A simplified block diagram of an extremum control system. 


can be improved by a proper adjustment, but the efficiency will depend on such 
conditions as the condition of the road and the load of the car. To maintain the 
optimal efficiency, it is necessary to change the ignition angle. 

Tracking a varying maximum or minimum is called extremum control. The 
static response curve relating the inputs and the outputs in an extremum 
control system is nonlinear. The task of the controller is to find the optimum 
operating point and to track it if it is varying. Several processes have this kind 
of behavior. Control of the air-fuel ratio of combustion is one example. The 
optimum will change, for instance, with temperature and fuel quality. Another 
example is water turbines of the Kaplan type, in which the blade angle of 
the turbine is changed to give maximum output power. The same problem is 
encountered in wind power plants, in which pitch angle is changed depending 
on wind speed. 

Extremum control is related to optimization techniques; many of the ideas 
have been transferred from numerical optimization. There was great interest 
in extremum control in the 1950s and 1960s, and some commercial products 
were put on the market. For instance, the first computer control systems in- 
stalled in the process industry were motivated by the possibility of optimizing 
the setpoints of the controllers. The interest then declined, partly because of 
the difficulty of implementing the optimizing controllers. Furthermore, there 
is great difficulty in finding appropriate process models. Developments in com- 
puters have led to a renewed interest in extremum control and its combination 
with adaptive control. Improved efficiency of the process can result in large 
savings in the energy and raw material costs. 

Figure 13.6 shows a simplified block diagram of an extremum control 
system. The process can work in open loop or in closed loop, as m the figure. 
The most important feature is that the process is assumed to be nonlinear in 
the sense that at least the performance is a nonlinear function of the reference 
signal, The goal of the search algorithm is to keep the output as close as 
possible to the extremum despite changes in the process or the influence of 
disturbances. The output used in the search algorithm is some measurement 
of the performance of the system—for instance, efficiency. The conventional 
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regulator can also use this signal, but it is more common for the regulator to 
use some other output of the process. 


Models 


Extremum control systems are, by necessity, nonlinear, How the processes are 
modeled is therefore all-important. Many investigations of extremum control 
systems assume that the systems are static. This assumption can be justified 
if the time between the changes in the reference value is sufficiently long. 
For static systems it is possible to use many of the methods from numerical 
optimization. A typical description of the process is 


y(t) = f (u(t), 6,2) (13.2) 


where f is a nonlinear function and @ is a vector of unknown parameters that 
may change with time. 

If there are dynamics in the process, the performance may not have settled 
at a new steady-state value before the next measurement is taken. This will 
give an interaction in the control system that can be difficult to handle. The 
dynamic influence will increase the complexity considerably. 

In many applications it is not easy to find the appropriate models and to 
determine the exact nature of the nonlinearities involved. It can therefore be 
appropriate to combine adaptivity and extremum control. One way to simplify 
the identification of an unknown nonlinear model is to assume that the process 
can he divided into one nonlinear static part and one linear dynamic part. 
Models with different properties are obtained if the nonlinearity precedes or 
follows the linear part. The complexity of the problem will also depend on 
which of the variables in the process can be measured. One special type of 
model that has been used in extremum control systems is the Hammerstein 
model. A typical discrete-time model of this type is 


A(q)y(t) = B(@) f (u(t) (13.3) 


where f is a nonlinear function, typically a polynomial. 

The main effect of an input nonlinearity is that it restricts the possible 
input values for the linear part. The nonlinear control problem can then be 
treated as a linear control problem with input constraints. The case with the 
output nonlinearity perhaps leads to more realistic problems but is also more 
difficult to solve. 


Extremum Control of Static Systems 


The first extremum control systems were based on analog implementation. 
One way to perform the optimization is the so-calied perturbation method. The 
basic idea is to add a known time-varying signal to the input of the nonlinearity, 
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then observe the effect on the output and make a correlation between these 
two signals. Depending on the phase between the two signals, the direction 
toward the extremum can be determined. The perturbation method has been 
used for extremum control of chemical reactors, combustion engines, and gas 
furnaces, for instance. 

Extremum control of static systems as in Eq. (13.2) is in essence a prob- 
lem of numerical optimization. With the analog implementations the possible 
methods were severely restricted. When a digital computer is available, stan- 
dard algorithms for function minimization can be used. Usually, it is possible 
only to measure the function values, not its derivative. The function minimiza- 
tion then has to be done by using numerically computed derivatives. Some 
methods use only function comparisons. These methods can be used even for 
minimization of nonsmocth functions. 

Performance measurements are typically corrupted by noise. It is then 
necessary to average out the influence of the noise. This implies that the gain 
in the optimization algorithm should go to zero. However, if the extremum is 
changing with time, the gain should not go to zero. This is the same compromise 
as is discussed in connection with tuning and adaptive control. 

Most schemes for extremum control of static systems do not build up any in- 
formation about the nonlinearity. The “states” of the algorithms are essentially 
the current estimate of the optimum point and some previous measurements. 
By using a model and system identification it is possible to utilize the mea- 
surements of the system better and to follow time variations in the process. 


Extremum Control of Dynamic Systems 


If there are dynamics in the process, it is necessary to take this into considera- 
tion in doing the optimization. The correlation and interaction between differ- 
ent measurements of the performance will otherwise confuse the optimization 
routine, One possibility, discussed previously, is to wait until the transients 
have vanished before the next change is made. Of course, this will increase the 
convergence time, especially if the process has long time constants. One way 
around the problem is to base the optimization on nonlinear dynamic models. 
An example is the Hammerstein model. A model of this type with an input 
nonlinearity of second order is 


A(q)¥(t) = bo + Bi(g)u(é) + Be(q)u*(é) + C(q)e(t) (13.4) 


The main reason for the popularity of the Hammerstein model is not that it is 
a good picture of the reality, but rather that it is linear in the parameters. The 
parameters can be estimated, for example, by using recursive least squares. 
The static response between the input and the output is given by 


A(1)yo = bo + Bi (1)uy + Bo(1)ué 
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The methods for static optimization discussed previously can now be used. Also 
note that the gradients and the Hessian are easily computed, a feature that 
will speed up the convergence. 


Conclusions 


The field of extremum control is far from mature. One crucial point is the 
modeling of the processes and the nonlinearities. It is generally very difficult 
to analyze nonlinear control problems and to derive optimal controllers, espe- 
ctally if there are stochastic disturbances acting on the system. The extremum 
control problem also has connections with the dual control problem discussed 
in Chapter 7. Extremum-seeking methods combined with adaptive control are 
of great practical interest, since even small improvements in the performance 
can lead to large savings in raw material and energy consumption. There are 
commercial extremum controllers. 


13.4 EXPERT CONTROL SYSTEMS 


All practical control systems contain heuristics. This appears as logic around 
the basic control algorithm. Adaptive systems have a lot of heuristics in the 
safety logic. Expert systems offer an interesting possibility of structuring the 
logic in a control system. If a good way to handle heuristic logic is available, 
it is also possible to introduce more complex control systems that contain sev- 
eral different algorithms. For example, it is possible to combine auto-tuners 
and adaptive algorithms that have complementary properties. The auto-tuner 
requires little prior information; it is very robust and can generate good param- 
eters for a simple control law. Adaptive regulators can be more complex, with 
potentially better performance. Since they are based on local gradient proce- 
dures, they can adjust the regulator parameters to give a closed-loop system 
with very good performance, provided that reasonably good a priori guesses of 
system order, sampling period, and parameters are given. The algorithms will 
not work if the prior guesses are too far off. With poor prior data they may 
even give unstable closed-loop systems. This has led to the development of the 
safety logic discussed in Chapters 11 and 12. 


Expert Systems 


One objective of expert systems is to develop computer-based models for prob- 
lem solving that are different from physical modeling and parameter estima- 
tion. An expert system attempts to model the knowledge and procedures used 
by a human expert in solving problems within a well-defined domain. Knowl- 
edge representation is a key issue in expert systems. Many different approaches 
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have been attempted, such as first-order predicate calculus (logic), procedural 
representations, semantic networks, production systems or rules, and frames. 
A knowledge-based expert system consists of a knowledge base, an inference 
engine, and a user interface. 


The Knowledge Base. The knowledge base consists of data and rules. The 
data can be separated into facts and goals. Examples of facts are statements 
such as “The system appears to be stable,” “PI control is adequate,” and 
“Deviations are normal.” Typical examples of goals are “Minimize the vari- 
ations of the output,” “Find out whether gain scheduling is necessary,” and 
“Find a scheduling table.” Data is introduced into the database by the user 
or via the real-time knowledge acquisition system. New facts can also be cre- 
ated by the rules. The rule base contains production rules of the type: “If 
premise then conclusion do action.” The premise represents facts or goals from 
the database. The conclusion can result in the addition of a new fact to the 
database or modification of an existing fact. The action can be to activate 
an algorithm for diagnosis, control, or estimation. These actions are differ- 
ent from those found in conventional expert systems. The rule base is often 
structured in groups or knowledge sources that contain rules about the same 
subject. This simplifies the search. In the control application the rules repre- 
sent knowledge about the control and estimation problem that are built into 
the system. This includes the appropriate characterization of the algorithms, 
judgmental knowledge about when to apply them, and supervision and di- 
apnosis of the system. The rules are introduced by the knowledge engineer 
via the knowledge acquisition system, which assists in writing and testing 
rules. 


Inference Engine. The inference engine processes the rules to arrive at con- 
clusions or to satisfy goals. It scans the rules according to a strategy, which 
decides from the context (current database of facts and goals) which produc- 
tion rules to select next. This can be done according to different strategies. 
In forward chaining the strategy is to find all conclusions from a given set 
of premises. This is typical for a data-driven operation. In backward chaining 
the rules are traced backward from a given goal to see whether the goal can 
be supported by the current premises. This is typical for a diagnosis problem. 
The search can be organized in many different ways, depth-first or breadth- 
first. There are also strategies that use the complexity of the rules to decide 
the order in which they are searched. To devise efficient search procedures, 
it is convenient to decompose the rule base into pieces that deal with related 
chunks of knowledge. If the rules are organized in that way, it is also possible 
for a system to focus its attention on a collection of rules in certain situations. 
This can make the search more efficient. 


User Interface. The user interface can be divided into two parts. The first 
part is the development support that the system gives. This contains tools 
such as rule editor and rule browser for development of the system knowledge 
base. The other part is the run-time user interface. This contains explanation 
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facilities that make it possible to question how a certain fact was concluded, 
why a certain estimation algorithm is executing, and so on. It is also possible 
to trace the execution of the rules. The user interface can also contain facilities 
to deal with natural language. 


Expert Control 


The idea of expert control is to have a collection of algorithms for control, 
supervision, and adaptation that are orchestrated by an expert system. A block 
diagram of such a system is shown in Fig. 13.7. A comparison with Fig. 1.19 
shows that the system is a natural extension of a self-tuning regulator. Instead 
of having one contro] algorithm and one estimation algorithm, the system has 
several algorithms. It also has algorithms for excitation and for diagnosis, as 
well as tables for storing data. Apart from this, the system also has an expert 
system, which decides when a particular algorithm should be used. The expert 
system contains knowledge about particular algorithms and the conditions 
under which they can be used. 

In the special case in which there is only one algorithm of each category, 
Fig. 13.7 can be viewed as a well-structured way of implementing safety logic 
for an ordinary adaptive regulator. In that case the approach has the advan- 
tage that it separates the safety logic from the control algorithms. Another 
advantage is that the knowledge is explicit and can be investigated via the 
user interface. 


Knowledge- 
based system 


Control ji 


Figure 13.7 A knowledge-based expert control system. 
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13.5 LEARNING SYSTEMS 


The notion of learning systems has been developed in the fields of artificial 
intelligence, cybernetics, and biology. In its most ambitious form, learning 
systems attempt to describe or mimic human learning ability. Attainment 
of this goal is still far away. The learning systems that have actually been 
implemented are simple systems that have strong relations to adaptive control. 
The systems have many names: neural nets, connectionist models, parallel 
distributed processing models, and so on. 


Michie’s Boxes 


This system grew out of early work on artificial intelligence (see Michie and 
Chambers, 1968) and attempts to balance an inverted pendulum (see Fig. 13.8). 
The system has four state variables, ø, @, x, and x, which are quantized in a 
crude way, with five levels for the position variables x and g and three levels 
for the velocity variables x and ¢. The state space can thus be described by 
225 discrete states. The control variable is quantized into two levels: force left 
(L) or force right (R). The control law can be represented by a binary table 
with 225 entries. In the experiment the table was initialized with randomly 
chosen [’s and R’s in the table. A simple scoring method was used to update 
the table entries as a result of experimental runs. Scoring was based on how 
long the pendulum stayed upright and the number of times the pendulum was 
in a discrete state. The system was able to balance the pendulum for about 
25 minutes after a 60-hour training period. The table that defines the control 
action can be expressed in logic as: 


if cart is far left and cart is hardly moving and pendulum is hardly 
leaning and pendulum is swinging to right then apply force right. 


For this reason the control law is also called linguistic control. When the logic 
is replaced by fuzzy logic, it is also called fuzzy control. 

The training algorithm that is used in Michie’s Boxes is similar to that 
used in programs for playing checkers and chess, but the pendulum problem is 


Figure 13.8 An inverted pendulum. 
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simpler than game playing. Training can be shortened by using a teacher, that 
is, by applying a scoring algorithm to an experiment in which the pendulum is 
balanced by an expert. A learning system of this type is obviously closely related 
to a model-reference adaptive system. The reference model can be viewed as a 
teacher. 


The Perceptron 


In a system such as Michie’s Boxes the control law is a logic function that gives 
the control action as a function of sensor patterns. The function is adaptive 
in the sense that it will adjust itself automatically. The perceptron proposed 
by Rosenblatt (1962) is one way to obtain a learning function. To describe 
the perceptron, let uj, i = 1,2,...,n, be inputs, and let y;, i = 1,2,...n, be 
outputs. In the perceptron the output is formed as 


yilt) = e( y wi; (t) u;(t) — ) t=1,2,...,m (13.5) 


j=l 


where wij are weights, b is a bias, and f is a threshold function, for example, 


1 ifx>0 
f= {o reo 
To update the weights, the perceptron uses a very simple idea, which is called 
Hebb’s principle: Apply a given pattern to the inputs and clamp the outputs 
to the desired response, then increase the weights between nodes that are 
simultaneously excited. 
This principle was formulated in Hebb (1949), in an attempt to model 
neuron networks. Mathematically it can be expressed as follows: 


wijt + 1) = w(t) + vuitt) (94) — y(i) (13.6) 
where y is the desired response and y; is the response predicted by the model 


Eq. (13.5). By regarding the weights as parameters, it becomes clear that the 
updating formula of Eq. (13.6) is identical to a gradient method for parameter 
estimation. 

Widrow and Hoff (1960) developed special-purpose hardware, called the 
Adaline, to implement perceptronlike devices. The learning algorithm used by 
Widrow was based on a simple gradient algorithm like Eq. (13.6). In devices 
like the perceptron and the Adaline, learning is interpreted as adjusting the 
coefficients in a network. From this point of view it can equally well be claimed 
that an adaptive system like the MRAS or the STR is learning. The mechanisms 
for determining the parameters are also similar. 

A drawback of the perceptron is that it can recognize only patterns that 
can be separated linearly. It fell into disfavor because of exaggerated claims 
that could not be justified. It was heavily criticized in a book by Minsky and 
Papert (1969}. The idea of designing learning networks did, however, persist. 
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The Boltzmann Machine 


The Boltzmann Machine may be viewed as a generalization of a perceptron. It 
was designed to be a highly simplified model of a neural network. The machine 
consists of a collection of elements whose outputs are zero or one. The elements 
are linked by connections having different weights. The output of an element 
is determined by the outputs of the connecting elements and the weights of the 
interconnections. The firing is randomized in such a way that the probability 
of firing increases with the weighted sum of the inputs to an element. Some 
elements are connected to inputs and others to outputs, and there are also 
internal nodes. The connections in a Boltzmann Machine are assumed to be 
symmetric, which is a significant restriction. 

In the perceptron there is a direct coupling between the inputs and the out- 
put. The Boltzmann Machine is much more complicated, because it can also 
have interna! nodes. This implies that Hebb’s principle cannot be applied di- 
rectly. An extension called back-propagation has been suggested in Rumelhart 
and McClelland (1986). 

There are many variations of neural networks. Dynamics can be introduced 
in the nodes. Hopfield observed that the weights could be chosen so that the 
network would solve specific optimization problems (Hopfield and Tank, 1986). 


Hardware 


An interesting feature of the neural networks is that they operate in parallel 
and that they can be implemented in silicon. Using such circuits may be a new 
way to implement adaptive control systems. A particularly interesting feature 
is that it is easy to integrate the networks with sensors. 


13.6 FUTURE TRENDS 


In this section we speculate on open research issues and the future of adaptive 
control. One interesting aspect of adaptive control is that it may be viewed 
as an automation of the modeling and design of control systems. To apply a 
technique automatically, it is necessary to have a very clear understanding 
of the conditions under which it can be applied. Ideally, this understanding 
should be formalized. Research on adaptive control will thus sharpen the 
understanding of control and parameter estimation. 


Industrial Impact 


There are adaptive systems that have been in continuous operation since the 
mid-1970s. Several products were announced in the early 1980s, and the de- 
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velopment has accelerated since that time. Adaptation is used both in general- 
purpose controllers and in dedicated systems. Practically all PID controllers 
that are introduced today have some facility for tuning and adaptation. This 
applies even to simple temperature controllers. It has taken longer for adaptive 
techniques to appear in distributed systems for process control, but many dis- 
tributed control systems are now provided with tuning and adaptation. There 
is a rich variety of special-purpose systems that use adaptation. For example, 
it has been shown that adaptation can provide improved riding quality in cars. 

In summary, many adaptive algorithms are well understood. Our insight 
into how adaptive methods can be used to engineer better control systems 
is growing. Insight, understanding, and appropriate computing hardware are 
available. It seems likely that a large proportion of the control systems made 
in the future will have automatic tuning or adaptation. When adaptive control 
becomes more widely used, interesting phenomena that demand theoretical 
understanding will undoubtedly also be observed. For instance, what happens 
when many adaptive controllers are connected to one process? Will they in- 
teract? How should the system be initialized? We can thus look forward to 
interesting developments. 


Algorithm Development 


There are several important issues that relate to algorithm development. Cur- 
rent toolboxes for adaptive control use only a few of the algorithms that have 
been developed. It seems safe to guess that the toolboxes will be expanded, and 
it would also seem useful to include auto-tuners in the toolboxes to simplify ini- 
tialization. Significant improvements can thus be achieved with tools that are 
already known, but there is also a need for improved techniques. Better meth- 
ods for control system design are needed. Techniques that can explicitly handle 
actuator constraints and model uncertainties would be valuable contributions. 
It would be very useful to have methods for estimating the unstructured un- 
certainties. 

Diagnostic routines that will tell whether a control algorithm is behaving 
as expected are needed. Such algorithms are well known for minimum-variance 
contro], in which monitoring can be done simply by calculating covariances. It 
is straightforward to develop similar techniques for other design methods. 

There is both theoretical and experimental evidence that probing signals 
are useful. It is also clear that it is not practical to introduce probing via 
stochastic control theory because of the excessive computational requirements. 
A significant challenge is therefore to find other ways to introduce probing. 
There are many who intuitively object to introducing probing signals inten- 
tionally. It must be remembered that a poorly tuned regulator will give larger 
than necessary deviations in controlled variables. A systematic approach to 
design and implementation of safety networks is an issue of great practical 
relevance. Expert systems may be useful in this context. 
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Multivariable Adaptive Control 


In this book we have focused on single-input, single-output systems, mainly 
to keep the presentation simple, but there has also been much research on 
multivariable adaptive control. Many of the results can be extended, but there 
is one large difficulty. For single-input, single-output systems it is possible 
to find a good canonical form to represent the systems in which the only 
parameter is the order of the system. For multivariable systems it is necessary 
also to know the Kronecker indices to obtain a canonical form. This is difficult 
both in theory and in practice. For special systems such as those found in 
robotics, a suitable structure can often be found by using prior knowledge of 
the system. 

Most adaptive control systems used so far are single-loop control. Coupled 
systems can be obtained by interconnection via the feedforward connection. 
Interesting phenomena can occur when such regulators are used on multi- 
variable systems; analysis of the behavior of such systems is a fascinating 
problem. 


Theoretical Issues 


There are many unresolved theoretical problems in adaptive control. For exam- 
ple, we have no good results on the stability of schemes with gain scheduling. 
Much work is also needed on analysis of convergence rates, On a very funda- 
mental level, there is a need for better averaging theorems. Many results apply 
only to periodic signals. This is natural, since the theory was originally devel- 
oped for nonlinear oscillations. It would be highly desirable to have results for 
more general signal classes. 

Several important problems have arisen in applications. The most impor- 
tant one is the design of proper safety logic; this is currently done in an ad 
hoc fashion. The development is also hampered by the fact that much of the 
information is proprietary, for competitive reasons. 


13.7 CONCLUSIONS 


In this book we have attempted to give our view of the complex field of adaptive 
control. There are many unresolved research issues and many white spots on 
the map of adaptive control. The field is developing rapidly, and new ideas are 
continually popping up. 

Our opinion is that adaptive control is a good tool that a control engineer 
can use on many occasions. We hope that this book will help to spread the use 
of adaptive control and that it may inspire some of you to do research that will 
enhance our understanding of adaptive systems. 
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